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ABSTRACT 


A rudimentary  translator  writing  systea  is  developed 
for  easy  iapleaentation  in  about  2 pages  of  asseably 
language  code.  Although  the  systea  is  based  on  backtrack 
parsing  and  lacks  a scanner,  it  still  perforas  useful 
translations  in  a few  ainutes  of  CPU  tine,  with  storage 
requireaents  of  about  10K  bytes,  for  a typical  translation. 

The  systea  is  based  on  an  ALGOL-like  prograa  by  Michels 
which  translates  source  language  strings  into  target 
language  strings,  according  to  a translation  graaaar  which 
is  specified  using  prefix  Polish  operators.  Fortunately, 
the  user  does  not  need  to  specify  translation  graanars  in 
Polish  notation,  because  Michels  gave  a metagraamdr  which 
translates  graaaars  in  BNP-like  notation  (including  the 
aetagranaar  itself)  into  Polish  strings. 

This  report  shows  how  Michels'  prograa  can  be 
iapleaented  without  the  aid  of  an  ALGOL  compiler,  yjle 
present  a translation  graaaar  for  converting  Michels^ 
prograa  (slightly  rewritten)  into  code  for  a simple, 
special-purpose  interpreter.  Once  this  simple  interpreter 
is  iapleaented,  and  Michels'  prograa  (in  interpreter  code) 
and  the  first  input  graaaar  are  prepared,  a small  translator 
writing  systea  is  complete.  In  this  primitive  system,  a 
translator  "program"  consists  of  the  BNF-like  description  of 
a translation  grammar. 

Michels'  program  was  written  with  the  goal  of 
conceptual  simplicity.  However,  in  actual  performance  it 
was  found  to  be  too  slow  to  be  practical.  Accordingly,  we 
present  a new  prograa  which  is  shorter,  aore  efficient,  and 
which  requires  only  a slightly  more  complex  interpreter. 


Key  words  and  phrases: 

metalanguage,  translator,  syntax-directed  translation, 
translator  writing  systea,  self-describing  graaaar, 
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1.  Introduction 

Programing  languages  should  not  interfere  with  the 
initial  steps  in  designing  programs.  Mirth  [11]  has 
proposed  that  prograas  be  written  using  "stepwise 
ref ineaent”,  in  which  several  versions  of  a prograa  are 
written  in  notations  convenient  to  the  prograaaer,  but 
approach  sone  existing  target  language  as  the  final  step. 
When  writing  nan?  prcgraas  in  a single  doaain  (e.g. 
graphics),  the  prograaaer  aay  find  a particular  interaediate 
language  useful  for  his  needs.  If  the  process  of  writing 
translators  and  interpreters  could  be  siaplified,  then 
special-purpose  interaediate  languages  could  be  iapleaented, 
obviating  the  need  for  the  final  refineaent  steps. 
Extensions  to  existing  extensible  languages  could  accoaplish 
a siailar  effect,  but  would  not  allow  the  freedoa  of 
language  design  available  to  a translator-writer,  and  would 
result  in  a larger,  aore  cuabersoae  language  with  unneeded 
features.  Also,  extensible  languages  are  generally  hard  to 
iapleaent  and  require  large  operating  systeas  [1,7,9,10]. 
In  view  of  the  rapid  proliferation  of  saaller  coaputing 
systeas,  it  would  be  useful  to  have  siaple  aechanisas  for 
iapleaenting  new  special-purpose  languages. 


In  this  report,  we  present  a siaple  iapleaentation  of 
Michels'  translator  writing  systea  [5],  which  is  based  on  an 
earlier  paper  by  Schorre  [0].  The  systea  allows  siaple 
kinds  of  translation,  directed  by  a user-produced  graanar 
which  is  augaented  with  output  syabols.  Michels  devised 
aetagraaaars  in  notation  siailar  to  BMP  [6]  which  define  the 
translation  of  qraaaars  (including  the  aetagraaaars 
theaselves)  into  strings  of  prefix  Polish  operators, 
suitable  for  execution  on  an  interpreter.  Using  ALGOL-like 
autually  recursive  procedures,  Michels  then  presented 
descriptions  of  interpreters  which  would  execute  a prefix 
Polish  operator  string  and  carry  out  translations  according 
to  the  graaaar  represented  by  the  operator  string.  The 
following  diagraa  shows  the  relationship  oetween  a BNP 
aetagraaaar,  a Polish  representation  of  the  saae  graaaar, 
and  an  interpreter. 
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If  an  arbitrary  BNP  translation  graaaar  G is  used  as 
input,  the  resulting  output  will  be  the  Polish  fora  of  the 
graaaar  (call  it  G-0) . If  SEEDO  is  then  executed  with  G-o 
as  its  progran,  any  input  will  be  recognized  and  translated 
according  to  the  rules  of  G.  In  this  systen,  G is  a siaple 
translator  "progran".  The  following  diagran  illustrates  the 
two-step  process  involved. 
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(*  indicates  typical  user-generated  input) 


This  report  extends  Michels'  work  by  actually 
iapleaenting  the  SEEDO  interpreter.  He  do  this  by 
translating  SEEDO  into  code  for  a low-level  interpreter. 
The  low-level  interpreter  is  siaple  enough  to  be  iapleaented 
in  about  2 pages  of  PDP-11  asseably  language  code.  Soae 
utility  routines  and  nachine-readable  texts  aust  also  be 
prepared  before  the  systen  is  coaplete. 

HcKeeaen  [4]  calls  the  systea  a SEED  because  even 
though  it  is  saall  (and  not  very  powerful  and  efficient  by 
itself)  it  can  be  used  as  a tool  to  iapleaent  languages  for 
writing  scanners,  parsers,  and  other  coaponents  of  aore 
sophisticated  translator  writing  systeas.  Unfortunately,  it 
turns  out  that  Michels'  SEEDO  prograa  fails  even  as  a SEED, 
because  it  is  considerably  slower  than  hand-translation  in 
sost  cases.  Accordingly,  we  present  an  iapleaentation  of 
SEEDO  only  to  show  the  essential  ideas  needed  for  a siaple 
translator.  He  then  present  a aore  efficient  prograa, 
SEBDl,  consisting  of  a saaller  set  of  recursive  procedures 
written  in  a slightly  aore  powerful  language.  The  new 
language  reguires  new  operation  codes  in  the  low-level 
interpreter.  However,  fewer  low-level  interpreter 


instructions  are  needed  to  implement  SEED1.  In  addition, 
coaputation  tiae  is  reduced  by  a factor  of  30  for  a typical 
translation . 


2.  Poraal  Definition  of  Translation 

The  aaterial  in  this  section,  as  well  as  in  Section  3, 
is  adapted  fron  Michels  [5] 

A context-free  granaar  G is  a quadruple  (Vt,  Vn,  S,  P) 
where: 

Vt  is  a finite  set  of  syabols  called  TERMINALS. 

Vn  is  a finite  set  of  syabols  called  NON-TERMINALS. 

(Vt  and  Vn  are  disjoint,  and  their  union  is  the  set  V) 
S is  a distinguished  aeaber  of  Vn  called  the  START  or  GOAL 

SYMBOL. 

P is  a finite  set  of  productions  such  that  each  production 
is  a pair  (a,b)  (alternate  notation:  a->b)  . The  LEFT 

PART,  a,  is  a syabol  in  Vn  and  the  RIGHT  PART,  b, 
is  a (possibly  null)  sequence  of  syabols  froa  V. 

The  postfix  operator  * will  denote  the 
set  closure  or  the  set  of  all  sequences 
of  syabols  in  a set.  For  exaaple,  V* 
represents  the  set  of  all  strings  that 
can  be  constructed  fron  the  syabols  in 
the  alphabet,  including  the  eapty 
string.  The  operator  ♦ denotes  the  set 
closure  with  the  exclusion  of  the  eapty 
string. 


The  set  of  productions  define  all  possible  derivations 
in  T.  For  all  (x,y)  in  P and  u,  v in  V*,  u is  derivable 

froa  v (written  v ->*  u ) if  u can  be  created  by 

substituting  y for  any  occurrence  of  x either  in  v,  or  in 
aay  string  derivable  froa  v. 

Any  string  derivable  froa  S is  a SENTENTIAL  FORM.  A 
sentential  fora  not  containing  any  elements  of  Vn  is  a FINAL 
SENTENTIAL  FORM,  or  SENTENCE. 

A TRANSLATION  GRAMMAR  ? is  a quintuple  (Vi,  Vo,  Vn,  S, 

P)  where  Vi  and  Vo  are  disjoint  sets  partitioning  Vt.  He 
call  Vi  the  INPOT  VOCABULARY  and  Vo  the  OUTPUT  VOCABULARY. 

Vn,  S,  and  P are  defined  as  before.  T is  said  to  TRANSLATE 

an  eleaent.  u of  Vi*  into  an  eleaent  v of  Vo*  if  and  only  if 
S •>*  s,  and  deleting  the  synbols  of  Vi  (respectively,  Vo) 
froa  s leaves  v (respectively,  u)  . Observe  that  T aay 
translate  u into  several  strings. 

- ■ i\ 
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Let  Pi  be  P with  the  syebols  froe  Vo  deleted,  and  Po  be 
P with  the  syebols  froe  Vi  deleted.  The  INPUT  LANGUAGE  Li 
of  T is  described  by  the  context-free  graaaar  Gi  3 (Vi,  Vn, 
S,  Pi).  The  OUTPUT  LANGUAGE  Lo  of  T is  described  by  the 
context-free  .graaaar  Go  = (Vo,  Vn,  S,  Po) . 

He  note  in  passing  that  every  translation  granaar  has  a 
■dual'*  translation  graaear  in  which  the  roles  of  Vi  and  Vo 
are  reversed.  Also,  we  could  generalize  translation 
graaaars  to  allow  aore  than  two  partitions  of  Vt. 


2.1  A Translation  Exaaple 

He  present  a graaear  that  will  translate  infix 
expressions  to  prefix  expressions  as  an  exaaple  of  this 
class  of  translations.  Consider  the  translation  graaaar  T 3 
(Vi,  Vo,  Vn,  S,  P')  where: 

Vi  3 ( + ,*,a,b)  This  is  the  alphabet  of  the  input  language. 
Vo  * (i,*,a,b)  This  is  the  object  language  alphabet;  in 
this  case  it  corresponds  one-for-one  with 
Vi.  The  syabols  are  underlined  to  differen- 
tiate the  two  sets. 

Vn  3 (S,T,F,I)  These  are  the  non-terainal  syabols. 

P»  3 {S->T,  T->*P*T,  T- >F,  F->*I*F,  F->I,  I->aa,  I->bb) 

This  is  the  set  of  productions  defining  the 
translation. 


T specifies,  aaong  other  things,  the  sapping  of  'a+b*a' 
to  **a*ba *.  The  full  derivation  is  as  follows: 

§£ai£JHill-l2£«  Ilinsi tional_Puie 

S Start  syabol 

T S->T 

♦F  VT  T->*F*T 

♦ P 4P  T->F 

♦F  vOI  *F  F->*I*F 

♦ I **1  *1  F- > I (used  twice) 

»j)a»*bb*aa  I->aa  (twice)  ,I->bb  (final 

sentential  fora) 

a*  b*  a Input  Sentence  (syabols  froa  Vo 

deleted) 

£ Output  Sentence  (syabols  froa  Vi 

deleted) 


3.  Translator  I apleaentation 

To  iapleaent  a translator  based  on  a translation 
graaaar,  we  aust  create  a parser  for  the  input  language. 
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Our  assumption  here  is  that  the  easiest  parsing  scheae  to 
isplesent  is  the  top-down  backtracking  approach. 
Conceivably,  Earley's  algorithm  [2]  or  some  other  aethod 
could  also  be  implemented  concisely,  but  this  matter  is 
beyond  the  scope  of  this  paper.  The  restrictions  given  in 
this  section  apply  to  our  method  of  backtrack  parsing. 

To  specify  the  restrictions,  we  will  group  together  all 
productions  having  the  same  left  part.  To  this  end,  we  use 
Pi,  the  set  of  input  productions,  to  construct  the  set  PLi. 
Each  element  of  PLi  is  a list  (a,b1 ,b2,. . . ,bn) ; a is  some 
left  part,  and  all  bi  are  corresponding  right  parts.  That 
is,  all  bi  are  included  in  an  element  of  PLi  if  and  only  if 
a->bi  is  an  element  of  Pi.  An  input  grammar  will  be 
represented  by  a description  of  PLi,  such  that  the  first 
element  is  the  list  in  which  a is  S,  the  start  symbol.  A 
translation  grammar  is  represented  by  PL,  which  is  PLi  with 
the  output  symbols  restored. 

An  ordering  on  the  alternative  right  parts  in  each 
element  PLi  is  defined  to  guarantee  that  if  two  right  parts 
can  generate  input  sentences  such  that  one  input  sentence  is 
a prefix  of  the  other,  then  the  right  part  generating  the 
longer  input  sentence  is  listed  first.  Formally,  for  all  p 
in  PLi,  if  b and  b*  are  right  parts  in  p and  b precedes  b', 
then  for  all  u,  u'  elements  of  Vi*  such  that  b =>*  u and 
b*  =>*  u*,  there  exists  no  u'  * ur,  where  r is  in  Vi*. 

No  productions  may  be  empty.  That  is,  for  all  a->b 
in  Pi,  b is  in  Vi*. 

Grammars  may  not  allow  left  recursion.  That  is,  there 
may  be  no  v in  Vn  and  u in  V*  such  that  v =>*  vu  . 

Deterministic  lef t-to-right  parsing  is  simplified  if, 
whenever  a prefix  of  the  remaining  input  is  to  be  derived 
from  a nonterminal  a,  the  prefix  yielding  a correct  parse  is 
the  longest  prefix  derivable  from  m.  That  is,  there  should 
not  exist  any  sequence  umcv  derivable  from  S,  and  both  x and 
xc  derivable  from  m,  where  S is  the  start  symbol,  u ar.d  v 
are  in  (Vi  U Vn)*,  a is  in  Vn,  and  x and  c are  in  Vi*. 

The  translators  to  be  described  in  this  paper  do  not 
detect  violations  of  the  restrictions  given  here;  they  will 
merely  produce  incorrect  parses,  or  fail  to  terminate  at 
all. 


3.1  A Simple  Translation  Language 

A metalanguage  can  be  defined  for  the  syntax  and 
translation  of  a translation  grammar.  The  notation  is 
similar  to  BNP  [6].  For  each  non-terminal  which  is  a left 
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part  of  a production,  all  right  parts  for  that  non"terninal 
are  listed,  separated  by  a slash  (/) . The  left  part  will  be 
separated  froa  the  alternative  right  parts  by  an  equal  sign 
(*) . Juxtaposition  will  denote  the  concatenation  of 
elements  of  . a right  part.  Double  quotes  (")  will  deliait 
eleaents  in  Vi*.  Brackets  ([ , ])  will  be  used  to  deliait 
eleaents  of  Vo*.  Single  letters  denote  eleaents  of  Vn . 
Normal  parentheses  can  be  used  to  alter  the  iaplied  operator 
precedence  and  to  reduce  the  nuaber  of  productions  required 
by  allowing  the  factoring  of  rules. 

Pig.  1 shows  G 1 , the  translation  graaaar  describing 
the  "graaaar  language",  expressed  in  its  own  language.  It 
is  a aodification  of  the  graaaar  given  in  Section  4. 1 of 
Hichels  [5].  (Productions  I,  S,  and  0 were  modified  so  as 
to  ainiaize  backtracking  in  aost  cases,  and  L was  split  into 
L and  T,  with  aany  new  syabols  added) . Multiple  blanks  and 
ends  of  lines  have  no  aeaning  in  the  language.  They  have 
been  used  here  to  iaprove  readability  and  should  be  ignored. 

Literal  strings  aay  be  of  arbitrary  length.  This 
creates  a problem  if  a string  aust  contain  a double  quote 
(")  , which  is  the  literal  deliaiter.  To  solve  this  the 
production  for  "I"  (see  Pig.  1)  is  ordered  to  test  for  a 
double  quote  as  the  first  character  of  a literal  string;  if 
one  is  found  it  is  assuaed  to  be  the  entire  string  and  oust 
be  followed  by  the  terainating  double  quote.  A double  quote 
in  any  other  position  of  a literal  string  is  assuaed  to  be 
the  terainating  delimiter  of  that  string.  A siailar 
convention  is  used  to  denote  right  bracket  ( ])  as  an  output 
syabol. 

The  self "translation  of  G1 , giving  the  "object  granmar" 
G1-0,  is  shown  in  Fig.  2.  Paragraphing  has  been  added  to 
iaprove  readability.  The  actual  machine  language,  as 
defined  by  the  translator  and  accepted  by  the  aachine,  would 
be  a continuous  string  of  characters.  The  only  significant 
blank  is  one  which  follows  an  odd  number  of  sharps  (#) . The 
next  section  explains  the  meaning  of  t. 


3.2  The  Object  Language 

The  output  of  the  aetatranslator  described  in  Section 
3.1  is  a description  of  a translation  graaaar.  This  output 
can  be  interpreted  by  the  prograa  to  be  described  in  the 
next  section,  and  can  be  thought  of  as  an  "object  language 
for  graaaars".  This  language  contains  five  prefix 
operators.  The  'G*  is  a binary  concatenation  operator;  it 
has  the  vaXue  TRUE  if  and  only  if  both  of  the  operands  which 
follow  it  are  TFOE . The  «/*  is  the  binary  alternation 
operator,  which  has  the  value  TRUE  if  either  of  the  operands 
following  it  are  TRUE.  If  the  first  is  TRUE  the  second  is 
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not  tested.  If  the  first  is  FALSE,  both  input  and  output 
strings  are  restored  to  their  pre-test  value  before  testing 
the  second.  The  is  a unary  non-tec ninal  operator;  it 
has  the  value  TRUE  if  the  rule  labeled  by  its  operand  is 
TRUE.  The  •#.'  is  a unary  terminal  operator;  it  has  the 
value  TRUE  if  the  current  character  of  input  is  the  sane  as 
the  operand,  in  which  case  the  input  is  advanced  one 
character.  The  •>•  is  a unary  operator  and  always  has  the 
value  TRUE.  The  character  following  it  is  appended  to  the 
end  of  the  current  output  string. 


3.3  The  Translator  Interpreter 

Me  now  define  an  interpreter  to  execute  the  object  code 
eaitted  by  the  translator  of  Section  3.1.  The  interpreter 
is  presented  as  a set  of  nutually  recursive  functions  in  the 
ALGOL-like  language  SEEDGOL-O , to  be  described  here  and  in 
Section  4. 

SEEDGOL-0  has  three  types  of  data:  strings,  single 
characters,  and  boolean  values.  All  strings  ace  substrings 
(in  fact,  "tails’*)  of  either  the  object  graaaar  G,  or  the 
source  input  I,  both  of  which  are  inputs  to  any  SEEDGOL-O 
progras.  Me  refer  to  the  sets  of  tails  of  G and  I as 
STRINGS(G)  and  STRINGS(I),  respectively. 

The  built-in  functions  of  SEEDGOL-O  ace  as  follows: 


First:  STRINGS  ->  CHARACTERS 

First  (S)  is  the  first  character  of  the  string  S. 

Rest:  STRINGS  ->  STRINGS 

Rest (S)  is  all  but  the  first  character  of  S. 

Output:  CHARACTERS  -> 

Output  (C)  has  no  value,  but  causes  the  side  effect 
of  sending  C to  an  output  device  or  buffer. 

Equal:  CHARACTERS  X CHARACTERS  ->  BOOLEANS 

Equal  (Cl, C2)  is  TRUE  if  and  only  if  Cl  and  C2 
are  identical  characters. 

Isnulli:  STRINGS  (I)  ->  EOOLEANS 

Isnulli(S)  is  TRUE  if  and  only  if  S is  null. 

The  built-in  operation  of  pairing  any  two  expressions 
is  also  allowed,  by  using  parentheses.  The  constants  TRUE 
and  FALSE  ace  included  in  the  language. 

He  now  give  the  functionality  and  leaning  of  the 
interpreter  definition  functions,  implemented  in  the 

SEEDGOL-0  prograi  SEEDO  (shown  in  Fig.  3). 
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IP  TEST(REST(GP)#IP)  then 

If  ISNULLI(REMAININ6(REST(GP)# IP) ) THEN 
(TRUE*  EMIT(REST(GP>*  IP)) 

ELSE  tFALSE*  NULL) 

ELSE  (FALSE*  NULL) 

I 

DEFINE  TEST(RP#  IP)  ■ 

IF  E0UAL<FIR3T(RP),  "I")  THEN  TEST(EIN0(GP#  REST(RP))#  IP) 
ELSE  IF  ,E0UAL(FIRST(RP>*  "I")  THEN 
IF  TEST(REST(RP>#  IP)  then 

TEST(SkIP(RE3T(RP),  IP)#  REMAINING(REST(RP)#  IP)) 
ELSE  FALSE 

ELSE  IF  EQUAL(FIRST(RP)#  "/")  THEN 
IF  TEST(REST(RP)»  IP)  THEN  TRUE 
ELSE  TEST(SKIP(REST(RP)#  IP)#  IP) 

ELSE  IF  EQUALtFIRST(RP)#  ">")  THEN  TRUE 
ELSE  EQUAL(FIRST(REST(RP))#  FIRST<IP))I 


DEFINE  REMAINING(RP#  IP)  ■ 

IF  EOUAl(FIRST(RP)»  "I")  THEN  REHAINING(FIN0(GP#  REST(RP))*  IP) 
ELSE  IF  EOUALIFIRST(RP)#  "»")  THEN 

REMAINING(SK!P(REST(RP)#  IP)#  REMAINING(REST(RP)#  IP)) 

ELSE  IF  EOUAL(FIRST(RP)#  V")  THEN 

IF  TE5T(REST(RP)#  IP)  THEN  REM A I N ING ( RE  ST ( RP ) » IP) 

ELSE  REMAIN!NG(SK!P(ReST(RP)#  IP)#  IP) 

ELSE  IF  EQUAL(FIRST(RP)»  ">")  THEN  IP 
ELSE  REST( IP ) I 


THEN 

"«") 


DEFINE  EMIT(RP#  IP)  • 

IF  EQUAL(FIRST(RP)#  "I") 

ELSE  IF  EQUAl(F!RST(RP)# 
(EMIT(REST(RP)#  IP)# 

EMIT(SKIP(REST(RP)» 
ELSE  IF  EOUALtFIRST(RP).  "/«) 
IF  TEST(REST(RP)»  IP)  THEN 
ELSE 

ELSE  IF  EOUAl(FIRST(RP)# 

ELSE  NULL) 


EMIT(FINO(GP# 

THEN 


REST(RP))#  IP) 


IP)#  REMAINING(REST(RP)»  IP))) 
THEN 

EMIT (REST(RP)#  IP) 
EMIT(SKIP(REST(RP)#  IP)#  IP) 

">")  THEN  OUTPUT(FIRST(REST(RP))) 


DEFINE  SKIP(RP#  IP)  « 

IF  EOUAL(FIRST(RP)#  "«") 
ELSE  IF  EOUAL(FIRST(RP)# 
ELSE  REST(REST<RP)>) 


then  SKIP(SKIP(REST(RP)#  IP)* 
"/«)  THEN  SKIP(SKIP(REST(RP)» 


IP) 

IP)#  IP) 


OEFINE  FIND(RP#  IP)  ■ 

IF  EOUALCFIRST(RP),  fIRST(Ip))  then  REST(RP) 
ELSE  FIND(SKIP(REST(RP)#  IP)#  IP)) 

ENO 


SEEOO 
FIGURE  3 


SEEDO:  STRINGS  (G)  X STRINGS  (I)  •>  BOOLEANS 

SEE  DO  (G,I)  is  TRUE  if  and  only  if  the  input 
I is  recognized  by  the  string  of  prefix  operators 
G.  As  a side  effect,  SEEDO  causes  the  proper 
characters  to  be  output. 

Test:  STRINGS (G)  x STRINGS  (I)  ->  BOOLEANS 

Test (RULE, INPUT)  is  TRUE  if  any  left-nost  substring 
of  INPUT  natched  by  RULE. 

Reaaining:  STRINGS  (G)  x STRINGS  (I)  ->  STRINGS  (I) 

Renaining  (RULE, INPUT)  is  the  substring  of  INPUT 
reaaining  after  the  substring  recognized  by 
RULE  is  removed. 

Eait:  STRINGS  (G)  x STRINGS (I)  -> 

Bait (RULE, INPUT)  has  no  value,  but  causes  output 
characters  to  be  sent  to  an  output  device  or 
buffer  while  INPUT  is  recognized  by  RULE. 


Skip:  STRINGS  (G)  x STRINGS  (I)  ->  STRINGS(G) 

Skip(RULE, INPUT)  is  the  substring  of  RULE 
reaaining  after  the  leftmost  operator  and 
its  operands  have  been  removed.  INPUT  has  no 
bearing  on  the  computation,  but  is  required 
by  syntax. 

Find:  STRINGS  (G)  x STRINGS  (G)  ->  STRINGS  (G) 

Find  (GRAMMAR, STRING)  is  the  substring  of  GRAMMAR 
labeled  by  the  first  character  of  STRING. 


4.  Implementing  SEEDGOL-O 

To  iapleaent  the  S EEDGOL-0  language,  we  will  produce  a 
translation  grammar  for  converting  SEEDGOL-0  programs  into 
interaediate  code,  and  we  will  produce  an  interpreter  for 
executing  the  code.  Me  will  refer  to  this  low-level 
interpreter,  MO,  as  a ''machine",  to  avoid  confusing  it  with 
the  SEEDO  program,  an  interpreter  for  object  graaaars. 

The  SEEDGOL-0  language  will  be  very  specialized, 
containing  aerely  the  constructs  needed  to  iapleaent  the 
SEEDO  interpreter  in  Fig.  3.  The  following  restrictions 
apply  to  the  SEEDGOL-O  language: 


1.  All  blanks  ace  ignored.  In  general,  a 
reserved  word  should  not  be  a prefix  of  an 
identifier  or  another  reserved  word. 


2.  There  are  no  declarations  other  than 
procedure  declarations.  The  only  variables  are 
the  paraeeters  BP  and  IP,  which  are  strings.  At 
the  outeraost  level,  IP  is  predefined  to  be  all  of 
the  input  string,  and  RP  is  predefined  to  be  sane 
as  CP.  GP  is  a constant:  the  object  graaaar 
string,  a sequence  of  prefix  operators. 


3.  All  procedures  have  exactly  2 paraoeters. 
Pairs  and  procedure  argunents  are  evaluated 
left-to-right. 


4.  Procedures  and  the  aainline  each  consist 
of  one  expression. 


5.  The  language  has  no  I/O  facility  other 
than  the  built-in  "Output"  function,  which  is 
ispleaentation  dependent.  Initialization  of  the 
input  and  graaaar  string  storage  areas  is  also 
ispleaentation  dependent,  and  is  assuaed  to  be 
coapleted  before  the  SEEDGOL-0  aachine  begins 
execution. 


4.1  Translating  SEFDGOL-0  into  Object  Code 

Pig.  4-A  shows  SGLOG  (SEEDGOL-J  Graaaar) , a graaaar 
for  describing  SEEDGOL-0  and  for  translating  it  into  a 
aneaonic  fora  cf  aachine  language  of  MO,  the  SEEDGOL-O 
aachine  (to  be  described  in  the  next  section) . The  output 
strings  of  SGLOG  are  either  instructions,  characters,  or 
nuaeric  constants.  Fig.  4-B  shows  the  sane  SGLOG  graaaar, 
with  outputs  represented  in  a fora  which  can  be  aore  easily 
interpreted  by  a conputer  prograa.  The  occurrence  of  "I" 
aeans  that  the  following  base-10  nunber  will  represent  an 
instruction,  while  a double  quote  (")  aeans  that  the  next 
character  is  a character  constant.  A "D"  precedes  a base-IC 
nuaerical  constant.  Pig.  5-B  shows  SEEDO-O,  which  is  the 
result  of  translating  SEEDC  (Pig.  3)  via  the  translation 
graaaar  SGLOG  (Pig.  4-B).  SEEDO-0  is  a fora  of  the  "object 
code"  for  SEEDO  on  MO.  A aneaonic  version  of  SEEDO-O  is 
given  in  Pig.  5-A. 
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8X1 1*13X5X91 101  lOX  101  10t13"S',K"I’*P’’  1 1 1 3"tI7I8I 1 1"S"K«I«P"  1 1 1"S"K" i"P" 
111*121 12X21 12X911 1 3"/17I8lll"S"K*,I"P«  I11«S’’K«I"P"  1 1 1*  121 12 121 12I91 1 1* 
I*XlOIlOIl3*’r’’I’’N’’0”  Xl!3l2l3l7l8IlI*l9Ill«rwI,,N"D"  1 1 l"3"K" I"P"  1 1 1*  12 1 
12X2X12110X13 
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iTf  n~rrrr 


1500  bytes  (8  bits/byte) 


(output  string) 


(buffer) 


optional 


LEN  0 I 1 I word  (16  bits/word) 


(no.  of  characters  in  0) 

n 


•=  rriTHTm 


j 900  bytes 


(object  grammar  string)  (fixed) 


I:  I I j II  11  1 11  INI  1500  bytes 


(source  input  string)  (fixed) 


LEN  I | 1 I word 


(no.  of  characters  in  I ) 


0 

rp|adr 

0 

Rpj  (ADR 

0MJ 

2500  words 


(parameter  stack) 


(stack) 


E: 


n 


j 40  words 


(evaluation  stack)  (stack) 

\rn  r — : 


I I I I I ~1  700  bytes 


(machine  code) 


(fixed) 


PC  f 1 


REG  I I 1 


(program  counter)  (word) 


(word) 


REG  2 1 j 


(word) 


M0,  The  SEEDGOL-0  Machine 
(with  typical  buffer  sizes  shown) 

Fig.  6 


scratch 

registers 


CONSTANT  OATA  ITEMS  ARE  DISTINGUISHABLE  FROM  INSTRUCTIONS. 
THE  SET  "ITEM"  IS  THE  UNION  OF  CHAR#  STRING#  AND  BOOLEAN. 

MO  MACHINE  INSTRUCTIONS 

FIGURE  7 


OPCODE 

mnemonic 

20  1 

OCSeUPTlON 

(ANY 

0 

OATA  ITEM) 
GR 

(ITFM)  RUSH  THE  ITEM, 

(STRING)  RUSH  pointer  to  first  character  in  g. 

1 

RR 

(STRING)  Push  rp  parameter  (STORED  at  the 
top  of  the  p stack). 

2 

IR 

• (STRING)  RUSH  IP  PARAMETER  (ST0RE0  NEXT  TO 
THE  TOR  OF  THE  P STACK), 

I 

FIRST 

(STRING  •>  CHAR)  REPLACE  TOP  OF  E MlTH  THE 

character  to  which  it  points. 

A 

REST 

(STRING  ->  STRING)  INCREMENT  PTR  AT  TOP  OF  E. 

S 

OUTPUT 

(CHAR  ->)  POP  TOP  OF  E TO  OUTPUT. 

6 

ISNULLI 

(STRING(I)  •>  BOOLEAN)  REPLACE  TOP  OF  E WITH 
TRUE  IF  IT  POINTS  TO  END  OF  I#  ELSE  F ALSF . 

7 

equal 

(ITEM  x ITEM  •>  BOOLEAN)  REPLACE  THE  TOP  2 

items  of  e with  true  if  equal#  else  false. 

8 

IF 

(ITEM  ->  ) POP  E,  IF  FALSE#  SET  PC  TO  LOC  OF 
NEXT  MATCHING  "THEN'*  ♦ 1. 

9 

THEN 

( ) SKIP  TO  NEXT  MATCHING  "ELSE"  ♦ 1. 

|0 

ELSE 

( ) NO  OPERATION)  SERVES  AS  MARKER  ONLY, 

li 

RROCN 

(ADDR)  compare  the  characters  in  m at  pc 
with  all  strings  FOLLOWING  return  instr-s 

ANYWHERE  in  M.  (BLANK  IS  ALWAYS  THE  FINAL 
CHARACTER.)  PUSH  THE  LOCATION  IMMEDIATELY 
AFTER  THE  MATCHING  STRING  (THIS 
IS  THE  CALLED  ADDRESS). 

12 

CALL 

( ADDR  * STRING  * STRING  ->  ) PUSH  PC  (RETURN 
ADDRESS)  ONTO  P#  THEN  PUSH  POP(E)  ONTO  P 

twice#  thereby  passing  the  ip  and  rp 
parameters,  then  set  pc  ■ pop(E)#  the  calleo 

ADORESS  (COMPUTED  AND  STACKED  BY  AN  EARLIER 
PROCN  INSTRUCTION), 

11 

RETURN 

( ) POP  p TWICE.  IF  EMPTY#  SET  HALT 

CONDITION#  ELSE  SET  PC  ■ POP(P)#  RETURNING. 

AIL 

RUSHES  ANO 

POPS  REFER  TO  THE  E STACK#  EXCEPT  AS  NOTED. 
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0.2  The  SEEDGOL-O  Machine 

A diagram  of  the  S EEDGOL-0  machine,  MO,  is  given  in 
Fig.  6.  MO  contains  two  string  storage  areas~one  for  the 
object  grammar  input  (G) , and  one  for  the  input  string  to  be 
recognized  and  translated  (I).  output  is  presented  serially 
to  a device  or  to  a storage  buffer.  There  is  an  evaluation 
stack,  E,  which  holds  all  operands.  The  machine  code 
contains  operators  which  affect  the  contents  of  E . Since 
SEEDGOL-O  allows  recursion,  there  is  a parameter  stack,  p, 
which  contains,  for  each  call  of  a procedure,  two  parameters 
(called  RP  and  IP),  and  a return  address.  The  lowest  level 
of  P corresponds  to  the  aainline,  for  which  there  is  no 
return  address.  The  machine  code  is  stored  in  an  area 
called  H,  with  PC  pointing  to  the  next  instruction  to  be 
executed.  M contains  both  instructions  and  constant  data. 
The  contents  of  the  areas  G,  I,  and  M do  not  change  during 
execution;  the  contents  of  the  stacks  E , P,  and  0,  as  well 
as  the  temporary  registers  REG1  and  REG2  and  the  program 
counter  PC,  do  ordinarily  change  during  execution. 


The  only  strings  which  can  be  referred  to  by  variables 
in  a SEEDGOL-O  program  are  tails  of  two  fixed  strings:  the 
grammar  (G)  and  the  inpu*  (I).  As  a result,  string 
variables  can  be  represented  by  pointers  to  locations  in  G 
or  I.  The  string  represented  by  a pointer  consists  of  all 
characters  in  G or  I beginning  with  the  character  pointed 
to. 


In  the  machine  code  storage  area  M,  constants  may  have 
any  representation  which  is  easily  distinguished  from  the 
instructions.  As  a matter  of  convenience,  the 
implementation  in  this  report  uses  8-bit  representations  of 
instructions  and  constants,  with  the  high-order  bit  set  to  0 
for  constants  and  1 for  instructions.  The  seven  remaining 
bits  are  sufficient  to  represent  the  14  instruction  codes 
and  34  constants  which  can  be  produced  by  SGLOG.  Included 
among  the  constants  are  the  booleans  FALSE  (represented  here 
by  0),  TRUE  (1),  and  alphabetic  and  other  characters 
(represented  by  ASCII  or  some  other  7-bit  code) . The 
representation  of  constants  is  specified  by  the  grammar 
SGLOG  (Fig.  4-B) . A description  of  the  machine  code 
operators  is  given  in  Fig.  7.  All  unary  and  binary 
operators  expect  their  operands  on  E.  Most  instructions  are 
quite  simple.  However,  IF,  THEN,  ELSE,  PROCN,  CALL,  and 
RETORV  are  interesting  enough  to  warrant  examples. 

He  first  describe  an  example  ot  an  1F-THEN-ELSF 
expression.  Referring  to  the  grammar  SGLOG  in  Pig.  4-B, 
one  can  see  that  the  IF  expression 
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IP  (expl)  THEN  (exp2)  ELSE  (exp3) 
will  be  translated  into 

(expl*)  IP  (exp2«)  THEN  (ezp3*)  ELSE, 

where  the  starred  expressions  are  the  machine  code  versions 
of  the  source  expressions,  and  capitalized  words  in  the 
second  line  denote  aachine  instructions.  Noraally,  (expl*) 
causes  a single  boolean  value  to  be  pushed  onto  the 
evaluation  stack,  E.  (A  pathological  SEEDGOL-0  program 
could  have  NOLL  or  an  ordered  pair  as  (expl) , thereby 
causing  zero  or  two  paraaeters  to  be  pushed  on  E.  This 
would  cause  the  aachine  to  fail.)  The  IP  operator  pops  this 
value  off  E,  and  transfers  prograa  control  to  the  next 
aatching  THEN  ♦ 1 if  the  value  is  FALSE.  Otherwise, 
execution  continues  at  the  next  instruction  (i.e.,  exp2*  is 
executed) . The  "then"  part  of  the  if  expression  is  exp2*, 
delimited  by  the  IF  and  THEN  instructions.  The  "else"  part 
of  the  if  expression  is  exp3*,  deliaited  by  THEN  and  ELSE. 
Nhen  a THEN  is  executed,  the  prograa  counter  is 
unconditionally  advanced  to  the  next  matching  ELSE  ♦ 1,  and 
an  ELSE,  when  executed,  is  a no-op.  ELSE  is  merely  a aarker 
used  by  the  THEN  instruction. 

He  now  consider  the  mechanics  of  procedure  calls  in  HO. 
Suppose  that  the  SEEDGOL-O  source  specifies  the  invocation 
of  the  procedure  JUNK,  as  follows: 


JUNK(exp1,  exp2) 

This  will  be  translated  into 

PROCN  JUNK  (expl*)  (e xp2*)  CALL  . 

Hhen  the  PROCN  instruction  is  executed,  the  entire  aachine 
code  prograa  is  searched  for  an  occurrence  of  a RETURN 
instruction,  followed  by  the  characters  JUNK.  (Since 
all  procedures  must  follow  other  procedures  or  the  aainline, 
and  all  procedures  have  the  characters  of  their  name  first, 
we  see  that  the  only  place  to  look  for  procedure  naaes  as 
destinations  is  iaaediately  following  any  RETURN 

instruction.)  PROCN  thus  deteraines  the  called  address,  and 
pushes  it  on  E.  Next,  two  expressions,  the  actual 
paraaeters,  cause  two  values  to  be  pushed  on  E.  CALL  then 
pushes  three  iteas  onto  P:  the  contents  of  PC  (i.e.  the 

return  address),  and  the  actual  values  of  the  two  paraaeters 
(these  values  are  popped  froa  ?;  their  order  on  P is 
reversed)..  Finally,  CALL  transfers  control  to  the  called 
procedure  by  popping  E once  aore  and  storing  this  value  (the 
called  address)  in  PC.  At  the  end  of  a procedure,  a RETURN 
instruction  is  executed.  It  pops  the  top  two  parameters  and 
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the  return  address  fron  p,  transferring  control  to  the 
return  address. 

B5700  Extended  ALGOL  and  PDP-1 1 assembly  language 
descriptions  of  HO  are  given  in  Appendices  A and  B. 


4.3  Initial  Bootstrapping 

To  create  a siaple  translator-writ ing  systen,  the  user 

■ust: 

1.  Transcribe  G1-0  (Pig.  2),  the  "object" 
grauear  grannar  into  machine  readable 
f oru. 

2.  Transcribe  SEEDO-O  (Fig.  5-B) , the 
object  code  for  SEEDO  on  MO,  into 
■achine  readable  fore. 

3.  Iepleeent  HO  itself  (Appendices  A and  B 
are  exanples)  , 

4.  Iepleeent  a Beans  for  getting  prefix 

Polish  "object"  granaars,  HO 

instructions,  and  input  strings  into  the 
storage  areas  for  HO,  and  a leans  to 
retrieve  the  output  string  after 
translation. 

Once  these  four  steps  are  complete,  a graeaar  subeitted 
as  input  will  be  translated  into  its  object  fora,  and  this 
object  graeear  can  replace  G1-0  to  give  a new  translator. 
This  process  is  represented  by  the  second  diagraa  in  Section 
1.  (One  can  also  describe  such  a series  of  translations  in 
a functional  notation  similar  to  GENESIS  [3],  a language 
used  to  describe  sequences  of  program  runs.) 


5.  An  Improved  Interpreter 

It  was  originally  hoped  that  the  SEEDO  prograa, 
although  clearly  inefficient,  would  still  be  able  to 
translate  graaaars  in  a reasonable  amount  of  tiae.  It  was 
to  be  used  only  for  a few  simple  translations  before  being 
replaced  by  a aore  efficient  translator.  However,  we 
estimate  that  SEEDO  running  on  the  PDP-11  assembly  language 
version  of  HO  will  take  6 1/2  hours  to  translate  G1  to  Gl-O. 
Hand  translaticn  would  be  faster.  The  reason  that  SEEDO  and 
HO  have  beer  presented  at  all  is  that  they  provide  a 
conceptually  siaple  description  of  a basic  translator 
writing  systen.  He  shall  see  that  a practical  TVS  can  be 
obtained  by  extending  this  siaple  one. 


5.1  An  Analysis  of  SEEDO  Execution  Tine 

A casual  study  of  the  SEEDO  program  (Fig.  3)  shows 
great  inefficiencies.  The  primary  observation  is  that  once 
Test  (RP, IP)  is  computed,  not  only  is  a prefix  of  IP  accepted 
or  rejected,  but  the  length  of  the  prefix  is  known,  and  the 
output  symbols  encountered  in  PP  during  the  recognition  of 
IP  are  also  known.  If  we  could  make  this  information 
available  to  the  mainline  program  on  the  outermost  call  of 
TEST,  we  could  reduce  computation  time  by  a factor  of  3, 
since  Remaining  and  Emit,  both  as  time-consuming  as  Test, 
would  not  need  to  be  invoked.  (Note  that  Emit  would  then 
become  totally  unnecessary.) 

More  significantly,  we  could  eliminate  the  call  of 
Remaining  from  within  Test.  This  occurs  in  the  following 
context  within  Test: 

Test(RP,IP)  = ... 

...  IF  Equal  (First  (PP)  ,"&")  THEN 

Test  (Skip  (Rest  (RP)  , IP)  , Remaining  (Rest  (RP)  , IP)) 
ELSE  FALSE 


Suppose  the  G and  I strings  are  as  follows: 

...  & (A)  (B)  ...  . . .aaaaaaabbbbbbbb. . . 

G I 

(A)  and  (B)  are  strings  which  comprise  the  two  operands  of 
&.  Suppose  RP  begins  with  the  &.  Then  Rest  (RP)  begins  at 
the  head  of  the  substring  (A),  while  Skip  (Rest  (RP)  , IP) 
begins  at  the  head  of  the  substring  (B) . Me  first  use 
Rest(RP)  to  recognize  a prefix  of  IP  (the  a's  above,  say), 
and  if  successful,  we  use  Skip  (Rest (RP) , IP)  to  recognize 
part  of  the  remainder  of  IP  (represented  by  the  b's  above) . 
Remain ing  (Rest  (RP)  , IP)  calculates  this  remainder;  we  do  not 
retain  the  information  about  the  length  of  the  prefix  of  IP 
(the  a's  above)  recognized  during  the  call  of 
Test  (Rest  (RP)  ,IP)  . 

Let  n be  the  level  of  recursive  calls  of  Test  with 
First(BP)  = (i.e.  n equals  the  depth  of  recursion  in 

Test,  minus  those  calls  which  do  not  have  First  (RP)  = "&") . 

Since  Remaining  can  call  Test  (if  First  (RP)  = "/")  with 
about  the  sane  likelihood  that  Test  can  call  Remaining,  let 
us  assume  that  all  Remaining  and  Test  calls  made  at  the  same 
n-level  require  the  same  amount  of  time,  T(n-1).  Then  we 
estimate  that: 


T(n)  * 3T(n-1),  i.e.  T (n)  = 3**n,  if  T (3 ) = 1 
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Bat  if  Test*s  call  to  Regaining  were  reaoved,  then: 

T(n)  = 2T  (n-1)  , i.e.  T(n)  = 2**n,  if  T (0)  = 1. 

Even  though,  the  majority  of  calls  of  Test  and  Renaining 
would  not  take  advantage  of  this  savings,  it  is  still 
significant,  because  it  grows  exponentially  with  the  depth 
of  recursion,  which  can  get  quite  large  (e.g.  about  200  for 
G 1,  with  n approaching  30  or  so) , suggesting  that  a 
substantial  speedup  is  possible.  A new  program  could  carry 
out  a large  part  of  the  computation  done  by  SEEDO  in  roughly 
(2/3)  **n  the  time. 


5.2  SEED1,  A New  Interpreter 

We  now  present  a new  Polish  grammar  interpreter,  SEED1, 
which  is  more  efficient  than  SEEDO.  First,  there  are 
several  new  primitive  functions  needed: 

save  : BOOLEANS  X STRINGS  (I)  x STRINGS(O)  -> 

BOOLEANS  X STPINGS(I)  X STRINGS (0) 

Save  is  the  identity  function,  but  it  causes  a 
triple  to  be  stored  in  a special  temporary 
location  local  to  the  procedure  being  executed, 
btenp  : BOOLEANS 
Itemp  : STRINGS  (I) 
otemp  : STRINGS  (0) 

These  three  constants  recover  the  temporary  values 
stored  by  Save. 

Boolean  : BOOLEANS  x STRINGS  (I)  x STRINGS  (0)  ->  BOOLEANS 

This  function  merely  extracts  the  boolean  component  of 
a triple. 

The  following  built-in  function  has  been  modified: 

Output  : STRINGS  (0)  X CHARACTERS  ->  STRINGS  (0) 

Output  (S,C)  appends  C to  S and  returns  the  new  S. 

The  SEED  1 program  (Fig.  8)  is  written  in  the  SEEDG0L-1 
language,  which  is  an  extension  of  SEEDGOL-0  and  contains 
the  new  primitive  functions  described  in  this  section,  as 
well  as  CASE  expressions  and  non-empty  tuples  of  arbitrary 
length.  In  the  new  interpreter,  SEED  1,  we  no  longer  find 
the  procedures  Remaining  or  Emit.  Skip  and  Find  are  just  as 
in  SEEDO,  except  that  a third  parameter,  which  is  not  used 
by  either  function,  is  required  by  the  syntax  of  the 
language. 


26 


IF  BOOLEAN($AvE(TEST(REsT(GP)#  Ip»  OP)))  THEN 
If  ISNULLIC JTEMp)  THpN  (TRUp#  OTeMP) 

ELSE  (FALSE#  NULL) 

ELSE  {FALSE#  NULL) 

I 

DEFINE  TESTtRP#  IP#  OP)  ■ 

CAsE  FIRST(RP)  OF 

"I"  I TEST(FINO(GP#  REST(RP)#  OP)*  IP*  OP) 

«*"  I IF  BOOLEAN(SAVE(TEST(PEST(RP)*  IP#  OP)))  THEN 

IF  BOOLEAN SAVE( TEST(SKIP( REST (RP)#  IP# OP >*  iTEMP*  OTEMP))) 
THEN  (BTfMP,  ITFMP#  OTEMP) 

ELSE  (FALSE#  IP#  OP) 

ELSE  (FALSE#  IP#  OP) 

I IF  BnOLEAN(SAVE(TEST(REST(RP)»  IP#  OP))) 

THEN  (BTEMP,  ITEMP#  OTEMP) 

ELSE  TEST(SKIP(REST(RP)#  IP#  OP)»  IP#  OP) 

I (TRUE#  IP#  CUTPUTCOP#  F IR  ST  ( RE  ST  ( RP  ) ) ) ) 

I IF  EOUAL(FIRST{REST(RP) )»  FIRST(IP)) 

THEN  (TRUE#  REST(IP)#  OP) 

ELSE  (FALSE*  !P»  OP) 

ENDCASE) 

DEFINE  SKIP(RP#  IP#  OP)  ■ 

IF  E0UAL(FIRST(RP),  "A")  THrN  SK I P { SK IP ( RE  ST ( RP ) » IP#  OP)#  IP#  OP) 
ELSE  IF  EOUAL(FIRST(RP)#  "/")  THEN  SK I P ( SK I P ( REST ( RP ) » I P# OP  ) # I P# OP  ) 
ELSE  REST(REST(RP)  )J 

define  FIND(RP#  IP#  OP)  ■ 

IF  EQUAL(FIRST(RP)#  FIRST(Ip))  THEN  REST ( RP ) 

ELSE  FIN0(SKIP(REST(PP),  IP,  OP)#  IP#  OP); 

ENO 


SEED1#  THE  NEW  GRAMMAR  INTERPRETER*  IN  SEEDGOLM 

FIGURE  8 


-A. 
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The  functionality  of  SFED1  and  its  procedure  Test  are 
as  follows: 

SEED1  : STRINGS  (G)  X STRINGS  (I)  ->  BOOLEANS  X 

STRINGS  (0) 

SEED1  (G,  I)  equals  (TRUE,  0)  if  and  only  if  the 
input  I is  recognized  by  the  object  grammar  G,  in 
which  case  0 is  the  output  string.  Otherwise, 

SEEDl  (G,  I)  equals  (FALSE,  NULL). 

Test  : STRINGS  (G)  x STRINGS  (I)  X STRINGS  (0)  -> 

BOOLEANS  x STRINGS  ( I)  X STRINGS  (0) 
Test(R,  I,  0)  returns  (B,  I',  O'),  where  B is  TRUE 
if  and  only  if  a prefix  of  I is  recognized  by  R, 
in  which  case  I'  is  the  tail  of  I remaining  after 
recognition  by  R,  and  O'  is  the  concatenation  of  0 
with  the  output  characters  encountered  during 
recognition  of  I by  R. 


6.  Implementing  SEEDG0L-1 

SEEDGOL- 1 has  restrictions  similar  to  those  of 
SEEDGOL-O: 

1.  As  in  SEEDGOL-0 , all  blanks  are  ignored, 
and  no  reserved  word  should  be  a prefix 
of  an  identifier  or  another  reserved 
word. 

2.  There  are  no  declarations  other  than 
procedure  declarations.  The  only 
variables  are  the  parameters  RP,  IP,  and 
OP  (strings) , and  the  local  temporary 
variables  Rtemp  (boolean) , Itemp,  and 
Otemp  (strings). 

3.  Procedures  have  exactly  3 parameters. 

Tuple  elements  and  procedure  arguments 
are  evaluated  lef t-to-right . 

4.  Procedures  and  mainline  consist  of  a 
single  expression. 

5.  The  language  has  no  I/O  facility 
whatsoever.  The  user  is  expected  to 
initialize  the  input  and  grammar 
strings,  and  retrieve  the  output  string 
from  its  buffer. 


6.1  Translating  SEEDGOL* 1 to  Object  Code 

The  translation  of  a SEEDGOL-1  program  into  object  code 
for  the  SEEDGOL-1  machine,  HI,  can  be  effected  by  the 
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translation  graaaar  SGL1G,  shown  in  Pigs.  9-A  and  9-B. 
Fig.  10-B  shows  SEEDI-O,  the  result  of  translating  SEED  1 
(Pig.  8)  via  SGL1G.  SEEDI-o,  when  converted  to  binary 
fora,  implements  SEED  1 on  Hi.  Fig.  10-A  gives  a aneaonic 
representation  of  SEEDI-O. 


6.2  HI,  the  SEEDGOL-1  Hachine 

The  SEEDGOL-1  machine.  Hi,  requires  exactly  the  same 
storage  areas  and  registers  as  the  SEEDGOL- 9 machine,  HO 
(Pig.  6),  with  the  exception  that  the  output  buffer  is  no 
longer  optional,  but  required,  because  the  ability  to  reset 
the  output  pointer  is  needed.  The  main  differences  between 
the  two  machines  are  in  the  instruction  set.  Three 
instructions,  OUTPUT,  CALL,  and  RETURN,  have  been  changed 
slightly  in  meaning,  while  PARH  replaces  IP  and  RP  and 
handles  the  parameter  OP  and  the  local  variables  Btemp, 
Itemp,  and  Oteap  as  well.  The  four  operators  CASE,  TEST, 
ENDTST,  and  ENDCAS  implement  both  case  and  if-then-else 
expressions.  POP  is  a new  opcode,  used  twice  by  the 

"BOOLEAN"  function  (see  Fig.  9-A)  The  new  opcodes  are 
summarized  in  Fig.  11.  ALGOL  and  PDP-11  assembly  language 
programs  to  implement  HI  are  shown  in  Appendices  C and  D. 

The  case  expression  operators  are  worthy  of  special 
comment.  Suppose  a SEEDGOL-1  program  contains 

CASE  (expC)  OF 

(expl)  : (expA) 

(exp2)  : (expB) 

(exp3)  : (expC) 

ENDCASE  . 


Then  the  object  program  would  read: 


CASE  (expC*)  (expl*) 

CASTST 

(expA*) 

ENDTST 

(exp2*) 

CASTST 

(expB* ) 

ENDTST 

(exp3*) 

CASTST 

(expC* ) 

ENDTST 

ENDCAS  . 

The  CASE  instruction 

is  a 

no-op 

used  merely  as  a 

bracketing  symbol.  After  expC* 

and  expl 

* are  executed,  two 

values  have  been  pushed  on  E.  CASTST  compares  them,  popping 
both  if  they  are  equal,  and  only  the  top  (the  value  of 
expl*)  if  they  are  not.  Then,  if  they  were  equal,  execution 
continues  at  CASTST  ♦ 1;  otherwise  execution  continues  at 
the  next  matching  ENDTST  ♦ 1.  (Here,  matching  is  determined 
by  counting  TEST'S  and  ENDTST's.)  when  an  ENDTST  is 
executed,  .the  program  counter  is  set  to  the  next  matching 
ENDCAS  (with  matching  determined  by  counting  CASE'S  and 
ENDCAS' s) . 
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MNMMMWKi 


OPCODE 

MNEMONIC 

DESCRIPTION 

( ANY 

CHARACTER) 

(CHAR)  push  the  OPCODE*  a character. 

0 

GP. 

(STRING)  PUSH  POINTER  TC  EIRST  CHARACTER  IN  G. 

1 

PARM 

(ITEM)  PUSH  THE  N-TH  PARAMETER*  WHERE  N 

IS  IN  THE  NEXT  INSTRUCTION  lOC  (ADVANCE  PC). 

2 

EIRST 

(STRING  ->  CHAR)  REPLACE  TOP  OE  E wlTH  THE 

character  TO  which  it  points. 

3 

REST 

(STRING  •>  STRING)  INCREMENT  PTR  AT  TOP  OE  E. 

« 

OUTPUT 

(STRING  * CHAR  ->  STRING)  POP  E*  STORING  THIS 
CHAR  IN  THE  0 LOCATION  GIVEN  BY  TOP(E)»  WHICH 

IS  THEN  incremented. 

S 

POP 

(ITEM  «>  ) POP  E. 

6 

ISNULLI 

CSTRING(I)  ->  900(.)  REPLACE  TOP  OE  E WITH 
TRUE  IE  IT  POINTS  TO  FNO  OE  I*  ELSF  EALSE. 

T 

Equal 

(char  * char  ->  bool)  replace  the  top  a chars 

ON  E kith  TRUE  IE  THEY  ARE  EQUAL*  ELSE  EALSE. 

8 

CASE 

( ) no-operation,  marker  only. 

9 

TEST 

( I T r m * ITEM  ->  Item  OR  NOTHING)  IE  the  TOP  2 
ITEMS  On  E ARE  EQUAL*  POP  THEM  ROTH, 
OTHERWISE.  POp  ONLY  the  TCP.  ANO 
aOVANCE  to  THE  NEXT  MATCHING  ENOTST  ♦ 1. 

10 

enotst 

( ) SKIP  TO  THE  NEXT  MATCHING  ENDCAS  ♦ 1. 

11 

ENOCAS 

( ) no-ope«ation.  marker  only. 

12 

PROCN 

(APCR)  COMPARE  the  CHARACTERS  IN  m at  PC  WITH 
ALL  strings  EOLLOWING  return  INSTRUCTIONS 
anykheRe  in  m.  push  the  location  immediately 
aETER  The  matching  string  (THE  CALLED 
ADDRESS).  AOVANCE  PC  BEyOnO  THE  PROCEOURE 
NAME. 

13 

CALL 

(AOOR  « ITEM  x ITEM  * ITEM  ->  ) PUSH  PC  (RETURN 

ADORE  SS ) ONTO  p.  EOLLOWEC  py  three  zero-s 
(TEMP  STORAGE)  ANO  THE  THREE  ITEMS  POPPEO 
rRO“  E,  POP  THE  AOOR  OFF  E»  ANO  BRANCH  TO  IT 

!• 

return 

( ) IF  P HAS  6 OR  LESS  ITEMS*  SET  THE  HALT 
FLAG.  OTHERWISE*  POP  THE  6 TOP  ITF-S  OFF  P 
(3  TEMP  VARIABLES*  ANO  3 PARAMETERS))  SET 
PC  TO  The  AOOR  POPPED  OFF  P NEXT, 

is 

SAVE 

(ITEM  x I TFM  x ITEM  •>  ITEM  * ITEM  x ITEM) 

COPY  THf  TOP  3 ITEMS  ON  E INTO  THE  3 CURRENT 
TEMP  STORAGE  LOCATIONS  IN  P, 

ill  pushes  an o pops  refer  To  the  e stack*  except  as  noted, 
instructions  “list  pc  made  distinguishable  frc m constants. 

THE  SET  "I TEH"  IS  THf  UNION  OF  CHAR*  STRING*  AND  BOOLEAN. 
PI  MACHINE  INSTRUCTIONS 
FIGURE  11 


34 


If  the  value  of  expO*  does  not  equal  the  value  of  any 
expression  before  a CASTST,  then  control  eventually  drops 
down  to  the  ENOCHS  instruction,  which  is  a no-op.  Since 
none  of  the  CASTST  instructions  popped  the  value  of  expO*, 
the  default  value  of  the  CASE  expression  is  the  value  of  the 
selector  expression  expO*. 

An  IP-THEN-ELSE  expression  is  translated  into  the  code 
for  a case  expression  in  a straightforward  nanner: 

IF  (expO)  THEN  (expl)  ELSE  (exp2) 

becoaes 

CASE  (expO*)  1 CASTST  (expl*)  ENDTST 
0 CASTST  (exp2*)  ENDTST 

ENDCAS  . 

According  to  the  conventions  of  the  graanar  SGL2G  and 
the  boolean- valued  priaitives  of  Hi  (ISNULL  and  EQUAL),  1 is 
the  value  of  TRUE,  and  0 is  equivalent  to  FALSE.  It  should 
be  clear  that  the  case  operators  above  iapleaent  the 
IF-THEN-ELSE  construct. 


6.1  Initial  Bootstrapping  with  HI 

To  get  the  SEED1  prograa  running  on  HI,  the  user  aust: 

1.  Transcribe  til-0  (Fig.  2),  the  object 
graanar  graaaar  into  aachine  readable 
fora. 

2.  Transcribe  SEED  1-0  (Fig.  10-B)  , the 
object  code  for  SEED1  on  Hi. 

3.  Inpleaent  HI,  the  SSEDGOL- 1 aachine 

(Appendices  C and  D are  exaaples) . 

4.  Iapleaent  a aeans  for  getting  prefix 

Polish  "object"  granaars.  Hi 

instructions,  and  input  strings  into  the 
storage  areas  for  HI,  and  a aeans  to 
retrieve  the  output  string  after 

translation. 


7#  Tiaing  Results  for  Different  Translators 

The  following  table  shows  the  tiaing  results  for  the 
translation  of  Gl  (Fig*  1)  to  G1-0  (Fig.  .2)  using  G1-0  as 
the  translation  graaaar. 
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SEEDO,  written  in  B5700  extended  ALGOL 
SEE DO,  on  NO  in  B5700  extended  ALGOL 
SEEDO , on  NO  in  PDP-11  assembly  language 
(•*> 

SBED1,  written  in  B5700  extended  ALGOL 
SBED1 , on  HI  in  B5700  extended  ALGOL 


SEBD1,  on  Hi  in  PDP-11  assenbly  language 


£119 

4 hours  (♦*) 

154  hours  (**) 

6 1/2  hours 

9 1/2  nin.  (25) 

2 hours,  15  nin. 
(68)  (**) 

1 hour,  20  nin.  (♦) 
14  1/4  nin.  (27) 

8 1/2  nin.  (*) 


(*)  — estiaate,  when  sped  up  using  the  PROCN2  acidification 
below. 

(**)  — estiaate,  based  on  partial  runs  and  coaparisons. 


Munbers  in  parentheses  indicate  speedup  factor  oxer 
corresponding  SEEDO  iapleaentation. 


Soae  tiling  tests  were  run  using  a aodified  PROCN 
instruction,  which  would  perfora  a linear  search  of  the 
aachine  code  storage  area  N only  once  to  deteraine  a called 
address.  It  would  then  store  the  called  address  in  a table, 
along  with  the  continuation  address  (the  address  at  which 
execution  continues,  after  the  procedure  naae  following  the 
PROCN  instruction) . PROCN  would  store  an  index  to  the  new 
table  entries  in  the  location  following  the  PROCN 
instruction  (which  was  foraerly  the  first  letter  in  the 
called  procedure's  naae).  Finally,  it  would  change  the 
PROCN  instruction  to  a PROCN2  instruction,  so  that 
subseguent  execution  of  the  instruction  would  aerely 
retrieve  the  called  address  and  the  continuation  address 
froa  the  table.  This  iaproveaent  yields  a 40*  speedup  in 
execution,  but  reguires  care  in  iapleaentation:  any 
aodif ications  to  N oust  not  create  entries  which  nay  be 
construed  as  CASE,  CASTST,  ENDTST,  or  ENDCAS  instructions. 
Because  of  this  "trickiness"  involved,  the  PR0CN2  feature  is 
not  included  in  the  Hi  aachines  described  in  Pig.  11,  or 
Appendices  C and  D.  Iapleaentation  of  this  feature  is  left 
to  the  reader. 


One  unexpected  anoaaly  which  appears  in  Table  1 is  the 
fact  that  the  advantage  of  SEED1  over  SEEDO  is  far  greater 
on  the  B5700  in  extended  ALGOL  then  on  the  PDP-11  in 
asseably  language.  Apparently  SEEDO  requires  extensive  use 
of  those  sections  of  code  which  are  particularly  inefficient 
in  ALGOL. 


8.  Conclusions 


36 


He  have  seen  that  a simple  translator  writing  system 
with  reasonable  storage  and  CPU  time  requirements  can  be 
easily  implemented  without  the  aid  of  a compiler.  The 
allowed  translations  are  those  which  can  be  expressed  via  a 
context-free. grammar,  augmented  with  output  symbols.  Such 
translations  can  include  renaming,  and  conversions  between 
infix,  prefix,  and  postfix  notations.  Address  calculations, 
and  other  effects  which  cannot  be  expressed  in  a 
context-free  grammar,  cannot  be  performed  by  this  siaple 
syntax-driven  method. 


We  observed  by  example  that  the  siaple  translations 
described  above  can  still  be  quite  useful,  since  address 
calculations  and  other  problems  can  be  postponed  until  run 
time,  and  handled  by  an  interpreter.  We  also  observed  that 
an  interpreter  for  a very  restricted  (but  still  useful!) 
language  can  be  quite  simple,  in  spite  of  the  need  to 
calculate  addresses.  Important  simplifying  aspects  of  a 
language  include  such  restrictions  as  a fixed  number  of 
parameters  for  any  user-defined  procedure,  no  user-declared 
variables,  and  a limited  set  of  built-in  functions. 


The  simple  TWS  given  in  this  paper  is  a step  in  the 
evolution  of  a more  sophisticated  TWS.  Hopefully,  this 
simple  TWS  will  facilitate  the  development  of  languages  :.ol 
describing  scanners,  parsers,  and  code  generators,  th*> 
constituents  of  typical  TWS's. 


It  is  a straightforward  process  to 
simple  system  on  other  machines,  including 


implement  this 
microprocessors. 
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APPENDIX  A 

BURROUGHS  B57Q0  EXTENDED  ALGOL  IMPLEMENTATION  OF  Mq 


BEGIN  X MO  IN  X ALGOL 

^declarations,  preliminaries 

DEFINE  TlLsSTEP  1 UNTIL#;  % A TEXT  SUBSTITUTION  MACRO 
ARRAY  MCODEtOt 10223; 

ARRAY  GItOt 10223;  X CHARACTER  STORAGE 
INTEGER  ENOl; 

integer  pc;  real  instruction; 

INTEGER  IPTR,  GPTR ; X FIXED  POINTERS  TO  G AND  I IN  CHAR  STORAGE 

X A STACK  IS  AN  ARRAY  WHOSE  O-TH  ELEMENT  POINTS  TO  THE  TOP 
OEFInE  STACK  = ARRAY#,  TOP(S)  = StStOUAi,  STACKSIZE(S)  s S[Ol#, 

NEXT ( S,  I)  = S r s c 0 3 - (i))#; 

real  procedure  puswcx,  sj;  value  x;  stack  st*n  real  x; 

push  «=  StStO]  * = stol  ♦ n Is  x;  X WARNING*  NESTED  ASSIGNMENT 

real  procedure  popcs);  stack  st*i; 

POP  »■  S( (SfO) » =StOT-l  ) * lj;  X WARNING*  NESTED  ASSIGNMENT 

STACK  P [0*102?];  X PARAMETER  STACK 
STACK  E [0:30])  X EVALUATION  STACK 

ARRAY  0 [0*1022);  X OUTPUT  BUFFER 

X XXSFTINBIT  SETS  THE  INSTRUCTION  BIT  OF  X 
X IS?INST(x)  IS  true  IF  M IS  AN  INSTRUCTION,  false  otherwise 
X X.DPFiELO  is  THE  OPCODE  of  m,  WITH  the  INSTRUCTION  BIT  REMOVED 
DEFINE  SETIN'BIT  s 1 t 7 1 0 : 1 ) # , InrIT  * [7*1]#,  INBITEIELD  = [7*0*1)#; 
DEFINE  RET7INST  = 1 34SETINRIT#, 

IF7INST  * PSSETTvRIT#,  THEN?INST  = 9&SETIN0IT#, 

ELSE7INST  * 1 C8SET I NR  I T« ) 

* THE  FOLLOWING  muST  CORRESPOND  WITH 
X CONSTANTS  GENERATED  RY  THE  GRAMMAR, 

DEFINE  FALSEVAL  = 0#,  TRUEVAL  = \*>  BLANK  = " ”ti 

OEFINE  IS?INST(m)  = C C M ) . INBIT  = 1)#*  OPFIELO  = t6*7)#; 
boolean  halt; 
integer  cycles,  tracecnt; 

PROCEDURE  INITIALIZE* 

8EGIN 

oroi  *=  p[0)  * = Etc)  *=  cycles  *=  tracecnt  i=  o; 

END  INITIALIZE; 

PROCEDURE  GETInput; 

I X USER-PROVIDEC  ROUTINE  to  LOAO  GI  (G  8 I)  AND  MCOOE 

X THE  main  ROUTINES  EXPECT  1 CHARACTER  or  OPCODE  PER  WORD 

PROCEDURE  summarize; 

I X IF  Etn  CONTAINS  "TRUEVAL"  THEN  THE  OUTPUT 

x string  is  contained  in  o c i 3 through  oistacksizeio)]* 

X ONE  CHARACTER  PER  WORD. 


L 


* complicated  instructions 

INTEGER  REGl#  RE&2J 

PROCEDURE  SKIPPAST(INST)) 

VALUE  INST)  REAL  INST) 

BEGIN 
REGl  «•  0) 

WHILE  REGl  GEQ  0 DO 
BEGIN 

IF  MCOOEtPCI  s INST  THEN  REGl  »«  REGl  - 1 

ELSE  IF  MC0DEIPC1  = IF7INST  THEN  REGl  »=  REGl  ♦ II 

PC  »■  PC  ♦ 1) 

ENO) 

END  SKIP  PAST) 

PROCEDURE  CALL) 

BFGIN 

PUSH ( PC  # P)l  t PUSH  RETURN  ADDRESS  ON  PARAMETER  STACK, 

Push ( pop ( e ) » p>)  x "IP"#  or  2nd  parm 

PUSH(POP(E>#  p>)  X «RP",  OR  1ST  PARM 

Pc  l»  POP ( E ) ) X PROCNAME  INSTR  FIGURED  OUT  EXACT  MACHINE  ADDRESS 
END  CALL) 

procedure  RETURN) 

IF  STACKSIZF(P)  LEG  ? THEN  HALT  ts  TRUE 
ELSE 

BEGIN  POPCP))  POP(P)) 

PC  1 = POP(P)) 

ENO) 


PROCEDURE  PROCNAMf) 

BEGIN  REG2  )e  1)  % 1 = START  OF  CODE 

DO 

BEGIN  REGl  «=  PC)  X START  OF  PROCEDURE  NAME 
WHILE  MC0DECREG21  NEQ  RET7INST  DO 
REG2  I*  REG2  + 1) 

REG2  :»  REG2  ♦ 1) 

WHILE  MCODEtREGl]  NED  BLANK  AND  MCODEtREGlJ  a MCODE  t REG2  3 DO 
BEGIN  REG 2 1=  REG2  ♦ 1) 

REGl  ta  REGl  ♦ l) 

END 

END 

UNTIL  MC0DECREG1)  = MC00ECRFG2J)  l BOTH  ARE  " " 

PUSHCREG2  ♦ 1*  E ) ) 

PC  *■  REGl  ♦ 1) 

eno  procname; 


^ 
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X MAIN  EXECUTION  LOOP  Of  INTERPRETER  * * * 

OEFlNE  FETCH  ■ INSTRUCTION  l»  MCOOEC  CPC  )*PC*1)-1  Jf  ) 


PROCEOURE  EXECUTE) 

IF  NOT  JS?INST( INSTRUCTION)  THEN 

PUSH(MCOOE(PC-l],  E ) X CURRENT  INSTRUCTION  IS  REALLY  A 

x character;  push  it  (char; 

ELSE 

case  instruction. oPFiELD  of 
begin 

x zero-ary  functions 

PUSH(GPTR,  E>) 

PUSH(TOP(P)»  E)) 

PUSH(NEXT(P#1)»"  ED 
X UNARY  FUNCTIONS 
PUSH(GIlPOP(En»  ED 
PUSH  (POP  ( E ) ♦ 1,  ED 
PUSH'POPCED  OD 


push c i f pop(E)  neo  end 


x binary  function 
IF  POP(E)=POP(E)  then  push<  trueval#  E>  Else  PUSH<FALSEvAL,  E)) 

X 7 , "ECUAL"  (CHAR  * CHAR  ->  BOOL) 


X CONTROL  INSTRUCTIONS 
IF  POP(E)=FALSfVAL  THpN  SK I PP A ST ( THEN? I N$T D 


X 

Bp 

"IE" 

SKIPPAST(ELSE?INST)) 

X 

9 p 

"THEN" 

I 

X 

10, 

"ELSE" 

PROCNAME) 

X 

11, 

"PROCNAME 

CALL) 

X 

12, 

"CALL" 

RETURN 

X 

13, 

"RETURN" 

End; 


X 0,  HGP"  PTR  TO  1ST  OF  G ( PTR ) 

x i,  "rp"  refers  to  p-stack 

X LOCATION  POINTING  TO  G (PTR) 

X 2»  "IP",  P-STACK  PTR  TO  I (PTR) 

X 3#  "F I RST"  (CHAR) 

X 4 , "REST-  (PTR  ->  PTR) 

X 5#  "OUTPUT"  (CHAR  ->  ). 

* LEAVES  NO  VALUE  ON  STACK,  OUTPUT 
X CHAR  CANNOT  BE  RETRIEVEO 

then  falseval  ELSE  trueval#  ED 

X 6#  "ISNULLI"  (PTR(I)  ->  BOOLE  AN ) 


X MAINLINE  ************** 

INITIALIZE) 

GETINPUT) 

pushmptR#  P)i  x pointer  to  1st  element  or  r ("iph  parameter) 

PUSH(GPTR»  PD  X POINTFR  to  1ST  ELEMENT  OF  G ("RP"  PARAMETER) 
PC  la  I) 

HALT  I*  FALSE) 

00 

BEGIN 

CYCLES  1=  CYCLES  * 1) 

FETCH) 

EXECUTE) 

END 

UNTIL  HALT) 
summarize; 

END. 


. 
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APPENDIX  fl 

PDP-11  ASSEMBLY  LANGUAGE  IMPLEMENTATION  OF  MO 


.TITLE  MO 

J MO,  IN  PDP-11  ASSEMBLY  LANGUAGE 


FILLIN  * 0 
EPC  » XO 
REGi  s XI 
REG2  * X2 
E » X3 
P ■ XA 
PTRO  ■ X5 
SP  ■ X6 
PC  * X7 
RA  « XA 
R5  * X5 


/.VALUE  TO  BE  FILLEO  IN  AT  RUN  TIME 
I EMULATOR  PROGRAM  COUNTER 
TEMPORARY  RFG  & INSTRUCTION  BUFFER 
/temporary 

/EVALUATION  STACK  POINTFR 
/PARAMETER  STACK  POINTER 

/ PTR.  TO  1ST  POSITION  AFTER  ENO  OF  OUTPUT 
/POP-11  STACK  pointer 
/POP-11  PROGRAM  COUNTER 


false  ■ 0 
TRUE  • 1 

IFX  * 200*8.  /SOME  EMULATOR  OPCOOES 

THEnX  > 200*9 • /BIT  7 IS  SET  SO  OPCODES  AND  CHARACTERS 
ELSEX  ■ 200*10.  /CANNOT  BE  COnFUSEO. 

RETX  * 200*13. 

BLANK  * " " /PROCECuRE  NAME  TERMINATOR 


ESTACKI  .8LKW  AO.  / EXECUTION  STACK 

estak,  t 

PSTACKI  .BlKW  1500.  / PARAMETER  STACK.  INCL  RETURN  AODRS 

p ST  AK  I 

STOPl*  .WORD  FALSE  /F OR  TERMINATING  EXECUTION  LOOP 


CYCLES! 

I! 

EDI ! 

Gl 

0» 

HI 

LENT  * 
LENOI 
PARSED! 


.CSECT  COmI  / FORTRAN  NAMED  COMMON  AREA 

.WORD  0.0  /DPL  PREC  COUNT  OF  EMULATOR  CYCLES 

.Blkb  looo,  /input  string 

.8LKB  1000,  /grammar  (OBJECT  VERSION)  STRING 

.blkb  2000,  /output  string 

.blkb  looo.  /emulator  code 

.WORD  FlLLiNI  NO.  OF  CHARS  ACTUALLY  IN  I 

.WORD  FlLLIM  NO,  OF  CHARS  IN  0 

.WORD  FlLLIM  BOOLEAN  VALUE.  TRUE  IF  SUCCESSFUL  PARSE 


.CSECT  IFACE  /INTERFACE  TO  FORTRAN  CODE  (SAVE  RA,  R5> 
IFACFI  MOV  RA,  SAVERA  /CALLING  SEOUENCEJ  CALL  IF  ACE 

mov  R5,  savers 

JSR  PC.  MO  /EXECUTE  The  EMULATOR 
MOV  SAVERS.  R5 
MOV  SAVERA.  RA 

RTS  R5  /RETURN  TO  005  FORTRAN 

SAVERA!  .WORO  FlLLIN 
SAVERS!  .WORO  FILLIN' 


•C3ECT  JTHE  EMULATOR  FOR  SEEDGOL-1 

I initialization  OF  emulator 

MO l MOV  #M,  EPC  I SET  PROG  COUNTER  TO  START  OF  CODE 

MOV  fESTAK,#  C IE-STACK  POINTER 

MOV  #PSTAK.,  P IP-STACK  POINTER 

MOV  fO#PTRO  INEXT  OUTPUT  SPACE  IS  THE  FIRST 

MOV  #1.  -(P)  IPUSH(PTR(I>»  P)  ("IP"  PARAMETER) 

MOV  # G»  -(P)  IPUSH(PTR(G)*  P)  ("RP"  PARAMETER) 

I FETCH-EXECUTE  LOOP 
LOOPi  INC  CYCLES 

CMP  cycles, #10000, 

BLT  LI 

INC  CYCLES+2 
CLR  CYCLES 

LI*  MOVB  (EPC)*,  REGl  IFETCH  INSTRUCTION,  ADVANCE  EPC 

BIC  #177400,  REGl  INSTRUCTION  Is  ONLY  ONE  BYTE 
BIT  #200*  REGl  IOPcODE  IF  BIT  7 IS  ON 
BNE  CASE 

MOV  REGl#-(E)  IOTHERWISE,  A LITERAL  CALL 

br  loop  i ( push  i word  with  o-s  in  high  byte) 

CASCl  BIC  #200*  REGl  IMASK  OFF  INSTRUCTION  BIT 
ASL  REGl  IFOR  WORD  ADDRESSING 

JSR  PC*  PnPS(REGi)  i branch  to  an  instruction  ROUTINE 
TST  STOP  ICHECK  HALTING  FLAG 

BEO  LOOP 

mov  (E),  parseo  itop  of  e Indicates  success  if  true 
MOV  PTRO*  LENO 
SUB  #0*  LENO 
RTS  PC 

• 

1 ADDRESSES  OF  THE  EMULATION  PROCEDURES  FOR  THE  OPCODES 
OPSi  ,W ORO  GP*  RP*  IP*  FIRST 

• WflRO  REST*  OUTPUT,  ISNULL*  EQUAL*  IF*  THEN 
•MORO  ELSE*  PROCN*  CALL*  RETURN 


I THE  EMULATION  PROCEDURES  FOR  THE  OPCODES 
GP*  MOV  #G*  -(E)  IPUSH(PTR(G)*  E) 

RTS  PC 

RP*  MOV  (P)*  -(E)  * Pij$H(  TOP(  P ) » E)  ("RP"  PARAMETER) 

RTS  PC 

IP*  MOV  2(P)»  -(E)  IPUSM(NEXT(P*1)»  E)  ("IP"  PARAMETER) 

RTS  PC 

FIRST!  MOVB  P(E)*  (E)  IPUSH(CONTENTS(P0P<E ) )*  E) 

BIC  #177400,  (E)  ITOP  BYTE  IS  ALL  O-S,  NON 
RTS  PC 

REST!  INC  (E)  ITOP(E)  »•  TOP(E)  ♦ l 

RTS  PC 

OUTPUT!  MOV  (E)**  REGl  |PUSH(POP(E  )*  0) 

MOvB  RFGl*  (PTRp)*  IwORD  TO  BYTE  CONVERSION 
RTS  PC 

ISNULL*  SUB  LENI*  (E)  IPUSH(POP(E)  ■ ENDOF(I),  E) 

SUB  #1*  (E) 

BEO  ISNUL2 

CLR  (E)  INEQ  ■>  FALSE 

RTS  PC 

ISNUL2*  MOV  #TRUE*  (E) 
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RTS 

PC 

EOUAll 

CMP 

(E >♦#  (E) 

BEO 

EQUAL2 

CLR 

(E) 

RTS 

PC 

E0UAL2I 

MOV 

#TRuE#(.E) 

RTS 

PC 

in 

TST 

<E>4 

BNE 

IF2 

MOV 

ATHENX#  REG2. 

JSR 

PC#  SKIPTO 

IF2l 

RTS 

PC 

THEN! 

MOV 

XELSEX#  REG2 

ELSE* 

JSR 

PC#  SKIPTO 

RTS 

PC 

PROCNI 

MOV 

«M»  REG2 

JPUSH(POP(E)«POP(E># 

12-byte  comparison  I 

JMUST  BE  EXPANOEO  TO 


E) 

single  byte  ITEMS 

WORDS  CONSISTENTLY 


ISKIP  TO  MATCHING 
JPQP(E)  IS  FALSE# 


•’THEN"  IF 
ELSE  00  NOTHING 


;SKIP  TO  MATCHING  "ELSE' 


PROCNlI 


PR0CN2I 


CMPB  (Re62)4, 
BNE  PR0CN1 
MOV  EPC » RFG1 
CMPB  ( REG  1 ) ♦ # 
BNE  PR0CN1 
CMPB  IRlANK#- 


)REG2 
ARETX 


WILL  POINT  TO  CHARS  AFTER  RETX 


BNE 

PR0CN2 

MOV 

REG2# 

-(E) 

MOV 

REGI# 

EPC 

RTS 

PC 

CALL* 

MOV 

EPC,  - 

(P) 

MOV 

(EW# 

-(P) 

MOV 

(E)4# 

-(P) 

MOV 

(E  )*» 

EPC 

RTS 

PC 

RETURNI 

CMP 

(P)  + # 

( P ) 4 

CMP 

P#  fPSTAK, 

BNE 

RET2 

MOV 

#TRUE# 

STOP 

RTS 

PC 

RET2I 

MOV 

( P ) 4 # 

EPC 

RTS 

PC 

SWlPTOi 

clr 

REGI 

SKIP2I 

CMPB  ( F PC ) 

# REG; 

BEO 

SKIP1 

CMPB  (EPC) 

4#  f II 

BNE 

SKIP2 

INC 

REGI 

BR  : 

SKIP2 

SKIP)  l 

INC 

EPC 

OEC 

REGI 

8GE 

SKIP2 

RTS 

PC 

jregi  will  POINT  to  chars  after  procn 

(REG2  )♦ 

1 F I NO  NEXT  "RETURN"  IF  MISMATCH 
1(REG2) 

^CONTINUE  UNLESS  BOTH  ARE  BLANK 
]PUSH(CALLING  AODR#  E) 

IEPC  l»  LOC  AFTER  NAME 

JPUSH(EPC»  P)  (RETURN  AOOR) 

* PUSH( POP ( E ) # P)  (R-PARAMETER)  . 

;PUSH(POP(E>#  p)  U-PARAMETER) 

;EPC  i«  POP(E)  (CALLING  AODR) 

;POP  2 WORDS  OFF  PSTACK 

;halt  if  p-stack  empty 


;EPC  I * RETURN  ADDRESS  («POP(P>> 


JTHTS  ROUTINE  USED  BY  "IF"  AND  "THEN" 
JAOVANCE  EPC#  INCREMENTING  ReGI  ON  "IF". 
)0ECREMENTING  REGI  ON  SOUGHT  INSTRUCTION 
MCONTAINEO  IN  REG2  ) • 

JOUIT  WHEN  REGI  IS  LESS  THAN  0 


.END 
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APPENDIX  C 

BURROUGHS  B5700  EXtENOEO  ALGOL  IMPLEMENTATION  OF  Ml 


BEGIN  X Ml  IN  XALGOL 

xdeclarations*  preliminaries 

OEFjNE  TIL«STEP  I UNTIL#)  X A TEXT  SUBSTITUTION  MACRO 
ARRAY  MC0DE(0»10223)  X MACHINE  CODE 

ARRAY  GI(0*10223)  X CHARACTER  STORAGE 

INTEGER  ENOI) 

integer  po  real  instruction) 

INTEGER  IPTR#  GPTR)  X >IXEO  POINTERS  TO  G AND  I IN  CHaR  STORAGE 

X A STACK  IS  AN  ARRAY  WHOSE  O-TH  CLEMENT  POINTS  TO  THE  TOP 
DEFINE  STACK  » ARRAY#*  TOP(S)  ■ scscon#.  STACKSIZE<S)  * StOJ#* 

NEXT ( S#  I)  » S[S[01  - (1)3#) 

REAL  PROCEDURE  PUSH(X*  S3)  VALUE  Xi  STACK  S(«3)  REAL  X) 

push  I*  srsroi  * = sin)  ♦ n »■  x)  x warning*  nested  assignment 

REAL  PROCEDURE  POP  (S3)  STACK  S(*l) 

pop  i*  st(stoi *=sioi-i ) ♦ n;  x warning*  nested  assignment 

STACK  P 10*102231  X PARAMETER  STACK 
STACK  E 10*303)  X EVALUATION  STACK 

array  o co* 10223)  x output  buffer 

X XlSET INBIT  SETS  THE  INSTRUCTION  BIT  OF  X 

X I S?I N$T( X ) IS  TRUE  IF  M IS  AN  INSTRUCTION*  FALSE  OTHERWISE 
X X.OPFIELO  IS  THE  OPCODE  OF  M,  WITH  THE  INSTRUCTION  BIT  REMOVED 
DEFINE  SETINBIT  » 1(7*0*13#*  INBIT  » (7*13#*  IN BITFIELD  ■ (7*0*13#) 
DEFINE  RET7INST  « 1 4 *Sf T INRI T#,  C ASE7I NST  a B4SETIN8IT#* 

TEST7I NST  » 9ASFTINBIT#,  ENDT  ST?  I NST  a 1 0 &SET I NB I T# * 

ENDCAS7INST  a 1USETINBIT#) 

* THE  FOLLOWING  MUST  CORRESPOND  WITH 

x constants  generated  by  the  grammar, 
oefine  falseval  ■ 0#*  trueval  * 1#*  blank  a * «#) 

DEFINE  I S? I NST( W ) a ((M). INBIT  « 1)#»  OPF IELD  » (6*73#) 

BOOLEAN  HALT) 

integer  cycles*  tracecnt) 

PROCEDURE  INITIALIZE) 

BEGIN 

0(03  *»  P( 03  * a E I Cl  *=  CYCLES  »*  TRACECNT  l»  0) 

ENO  INITIALIZE) 

PROCEDURE  GETINPUT) 

I X USER-PROVIDED  ROUTINE  TO  LO AO  G I < G & I)  AND  MCODE 

X THE  main  ROUTINES  EXPECT  1 CHARACTER  OR  OPCODE  PER  WORD 

PROCEDURE  SUMMARIZE) 

I X IF  E( 1 1 CONTAINS  "TRUEVAL"  THEN  THE  OUTPUT 
X STRING  IS  CONTAINED  IN  0(13  THROUGH  0(E(21J, 

X ONE  CHARACTER  PER  WORD. 
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* COMPLICATED  INSTRUCTIONS 

INTEGER  REGl#  ReG2! 

PROCEDURE  SKIPPAST(  INSTl#  INST?)! 

VALUE  INSTl#  INST?!  REAL  INSTl#  INST2! 

BEGIN 
REG1  !•  Of 
WHILE  REG1  GEO  0 DO 
BEGIN 

IF  HCODEIPC)  • INSTl  THEN  REG1  »•  REG1  ♦ 1 
ELSE  IF  MCODEIPCJ  « INST2  THEN  REG1  It  REG  1 - 1! 

PC  I*  PC  ♦ II 
END! 

END! 

PROCEDURE  CALL! 

BEGIN 

PUSH(PC#  P)l  » PUSH  RETURN  ADDRESS  ON  PARAMETER  STACK, 

THRU  3 DO  PUSH(0#  P>! 

THRU  3 DO  PUSH(POPIE)#  P)!  * OP#  IP#  AND  RP  PARAMETERS 

PC  *•  POP(E)!  * PRpCNAME  INSTR  FIGUREO  OUT  Exact  MACHINE  aooress 

END  CALL! 

PROCEDURE  RETURN'! 

IF  STACKSIZE(P)  LEO  6 THEN  HALT  »«  TRUE 
ELSE 

BEGIN  thru  6 CO  POP ( P ) ! 

PC  »■  POP ( P ) ! 

END! 

PROCEDURE  PROCNAMf! 

BEGIN  REG2  t*  1!  * 1 * START  OF  CODE 

DO 

BEGIN  REG  1 l«  PC!  t START  OF  PROCEDURE  NAME 
WHILE  MCODEtREGZJ  NE 3 RET7INST  DO 
REG2  tn  REG2  ♦ 1! 

REG2  «•  REG2  ♦ \f 

WHILE  MCnoEtREGll  NE 0 BLANK  AND  MCODE t Rf.6 1 3 • MC0DEIREG2]  DC 
BEGIN  REG2  »«  REG2  ♦ l! 

RE61  l a REG  1 ♦ 1! 

ENO 

ENO 

UNTIL  MCODEtPEGlJ  « MCODE  t REG2 ) ! * BOTH  ARE  BLANK 

PUSH(REG2  ♦ 1#  E > ! 

PC  l«  REGJ  ♦ li 
ENO  PROCNAMEi 
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C-3 


X MAIN  EXECUTION  LOOP  OF  INTERPRETER  * * • 

DEFINE  FETCH  ■ INSTRUCTION  !■  McODE t (PC » "PCM >-l 3 l! 


PROCFOURE  EXECUTE! 

BEGIN  INTEGER  I ! 

Ir  NOT  IS?INST( INSTRUCTION)  THEN 

PUSH ( INSTRUCT  I ON#  E)  X CURRENT  instruction  is  really  a 

X CHARACTER}  PUSH  IT  (CHAR) 

ELSE 

CASE  INSTRUCTION. OPFIELO  OF 
BEGIN 

X ZeRO-ARY  FUNCTIONS 

PUSH(GPTR#  E)l  * 0#  "GP".  PTR  TO  1ST  CHAR  OF  G 

X AND  TO  1 PAST  ENO  OF  I (PTR) 
PUSH(NEXT(P#  MCODEt (PCI»PC*1  )-l)  )#  E)! 

X 1#  "PARM"  GET  N-TH  PARM#  WHERE 
X N IS  IN  NEXT  INST  LOC 


X UNARY  FUNCTIONS 
PUSH(GI(P0P(E)1#  E)! 

X 

2# 

"FIRST"  (CHAR) 

PUSH ( POP ( E ) ♦ 1#  F)! 

X 

3# 

"REST"  (PTR  •> 

PTR) 

OtNEXT(E#l  )I«NEXT(E#1  )*1) 

1* 

POP(E)! 

X 

A # 

"OUTPUT"  (PTR 

x Char 

POP(E)l 

X 

5# 

"POP"  (ITEM  -> 

) 

pushhf  pop(F)«enoi  then  trufval  else  falseval#  e)) 

X 6#  "ISNULLl" 

X BINARY  FUNCTION 

IF  POP(E)«PCP(F)  THEN  PUSH(TRUEVAL#  E)  ELSE  PUSH(FALSEVAL#  E)! 

X 7#  "EOUAL"  (CHAR  x CHAR  ->  BOOL) 

X CONTROL  INSTRUCTIONS 
I X R#  "CASE" 

IF  POP(E)  NEC  TOP(E)  THEN  $K!PPASTCTFST?INST#  EN0TST7INST) 

ELSE  POP(E)i 

X 9#  "TEST" 

SKIPPAST(CASE?INST#  EmOCASTInST)} 

X 10#  "ENCTST" 

} X 11#  "ENCCAS" 

PROCNAmE)  X 12#  "PROCNAmE" 

CALL!  * 13#  "CALL" 

RETURN!  X 1A#  "RETURN" 

FOR  I I*  0 TIL  2 00  NFXT(P#  3M)  l»  NEXT(E#  I)! 

X IS#  "SAVE" 

ENO! 

ENO  EXECUTE! 


APPENDIX  0 

PDP-11  ASSEMBLY  LANGUAGE  IMPLEMENTATION  Or  Ml 


•TITLE  Mi 

I Ml,  IN  POP-11  ASSEMBLY  LANGUAGE 


PILLIN  • 0 
CPC  • «0 
REGl  • tl 
REGS  ■ X2 
E ■ 13 
P ■ X4 
PTRO  * *5 
SP  ■ t« 

PC  ■ t7 
R4  • 14 
R5  ■ t5 


1 VALUE  TO  BE  ElLLEO  IN  AT  RUN  TIME 
/EMULATOR  PROGRAM  COUNTER 
/TEMPORARY  REG  & INSTRUCTION  BUFFER 
/TEMPORARY 

/EVALUATION  STACK  POINTER 

/PARAMETER  STACK  POINTER 

/PTR,  TO  1ST  POSITION  AFTER  END  OF  OUTPUT 

/POP-l'l  STACK  POINTER 

/POP-11  PROGRAM  COUNTER 


FALSE  ■ 0 
TRUE  ■ 1 

TCSTX  ■ 200*9.  /SOME  EMULATOR  0 

ENOTSX  « 200*10.  /BIT  7 IS  SET  SO 

CASEX  • 200*8,  /CANNOT  BE  CQnFO 

ENDCAX  » 200*11. 

RETX  ■ 200*14. 

BLANK  « " » /PROCEDURE  NAME  TERMINATOR 


/SOME  EMULATOR  OPCODFS 

/BIT  7 IS  SET  SO  OPCODES  AND  CHARACTERS 
/CANNOT  BE  CONFUSED 


ESTACKI  «BLKW  40*  / EXECUTION  STACK 

ESTAK. I 

PSTACKI  .BLKW  3000.  / PARAMETER  STACK,  JNCL  RETURN  ADORS 

PSTak.i 

STOPi  .WORD  FALSE  /FOR  TERMINATING  EXECUTION  LOOP 


CYCLES! 

II 

coil 

Gl 

01 

Ml 

LENIt 

LCNOl 

PARSFDl 


IfACEl 


•CSECT  C0M1  / FORTRAN  NAMED  COMMON  AREA 


SAVERAI 
SAVERS  I 


• WORD 
.BLKB 

.BLKB 

.BLKB 

• BLKB 

• NORO 

• MORD 

• WORD 


0*0  /OfiL  PREC  COUNT  OE  EMULATOR  CYCLES 
1000,  /INPUT  STRING 

1000,  /GRAMMAR  (OBJECT  VERSION)  STRING 

2000.  /OUTPUT  STRING 

1000,  /EMULATOR  CODE 

FlLLIN/  NO.  OF  CHARS  ACTUALLY  IN  I 

FlLLIN/  NO.  OF  CHARS  IN  0 

FIllINZ  BOOLEAN  value.  TRUE  IF  SUCCESSFUL  PARSE 


.csect  iface  /interface  to  fortran  cqce  (save  ra,  rs) 

MOV  R 4 # SAVER4  /CALLING  SEOuENCEl  CALL  IFACE 
MOV  R5#  SAVERS 

JSR  PC*  MO  /EXFCUTE  The  EMULATOR 
MOV  SAVERS*  R5 
MOV  SAVER4#  R4 

RTS  R5  /RETURN  TO  DOS  FORTRAN 

.MORO  FlLLIN 
•WORD  FlLLIN 


•CSECT  / THf  EMULATOR  FOR  SEEOOOL-1 

/ INITIALIZATION  OF  emulator 

MO  I MOV  #M#  EPC  I SET  PROS  COUNTER  TO  START  OF  CODE 

MOV  iESTAK.#  e /e-stack  pointer 

MOV  #PSYAK.»6#  P JP-STACK  POINTER  WITH  3 TEMP  LOCATIONS 
MOV  #0#«'P)  IPUSH(PTR(0)*  P)  ("OP"  PARAMETER) 

MOV  #1#  *<P)  IPUSH(PTR(D*  P>  ("IP”  PARAMETER) 

MOV  #G»  “(P)  /PUSH(PTRCG)#  P)  ( "RP"  PARAMETER) 

I FETCH-EXICUTE  LOOP 
LOOP!  INC  CYCLES 

CMP  cycles##ioooo. 

0LT  LI 

INC  CYCLES*2 
clr  cycles 

LI 1 MOVB  (EPC)*#  RE61 
BIC  f 1 77A00#  RE61 
BIT  #200#  RE61 
BNE  CASES 
MOV  RESl#-(E) 

BR  LOOP 

CASES!  BIC  #200*  RES  1 
ASL  REGl 

JSR  PC#  POPS(REfil) 

TST  STOP 
BEO  LOOP 

MOV  2(E)#  PARSEO 
MOV  (E)#  LEND  J TOP  OF  E POINTS  TO  NEXT  AVAIL  SPACE  IN  0 
SUB  #0#  LENO 
RTS  PC 

/ ADDRESSES  OF  THE  EMULATION  PROCEDURES  FOR  THE  OPCODES 
OPSI  .NORO  GP#  PARM,  FIRST#  REST#  OUTPUT 
.WORD  POP#  ISNULL#  EOuAL#  CASE#  TEST 
.WORO  ENDTST#  ENDCAS#  PROCN#  CALL#  RFTuRN 
.WORD  SAVE 


/ THE  EMULATION  PROCEDURES  FOR  THE  OPCODES 


GP  1 

MOV  #G*  -(E) 
RTS  PC 

/PUSH(PTR(G)#  E) 

PARmi 

MOVB  (EPC)*#REGl 

/GET  P ARM  AT  N-TH  FROM 

TOP  OF  P# 

ASL  REGl 

/(DOUBLE  OFFSFT  FOR  WORD 

AOORESSING) 

ADO  P»REGl 
MOV  (REGl )#»(E) 
RTS  PC 

/N  IS  IN  NEXT  CODE  LOCATION 
/PUSH  result  ON  e 

FIRST! 

MOVB  P(E)#  (E) 

IPUSH(CONTEnTS(POP(E))*  E) 

BIC  #177A00»  (E) 
RTS  PC 

/TOP  BYTE  IS  ALL  0"S» 

NOW 

REST! 

INC  (E) 
RTS  PC 

ITOP(E)  1*  TOP(E)  ♦ l 

OUTPUT! 

MOV  (E)*»REG1 
MOVB  REOl#P(E) 
INC  (E) 

RTS  PC  ' 

/0(NEXT(E#l)I  * * POP(f) 
/TOP(E)  «•  T0P(E)*1 

POP! 

CMP  (E )♦» (E ) 
RTS  PC 

/POP(E) 

ISNULL! 

SUB  LENI#  (E) 

/PUSH(POP(E)  ■ ENDOF(I)#  E) 

/fetch  instruction#  aovance  epc 
/instruction  IS  only  one  byte 
/OPCODE  IF  BIT  7 IS  ON 

/OTHERWISE#  A LITERAL  CALL 
/(PUSH  1 WORO  WITH  0"S  IN  HIGH  BYTE) 
/MASK  OFF  INSTRUCTION  BIT 
/FOR  WORD  AOORESSING 
/BRANCH  TO  AN  INSTRUCTION  ROUTINE 
/CHECK  HALTING  FLAG 

/NEXT  OF  E INDICATES  SUCCESS  IF  TRUE 


rinunrr-'  i 1T1 


■ 
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SUB 

#1#  (E) 

BEQ 

ISNUL2 

CIR 

(E) 

JNEO  ■>  FALSE 

RTS 

PC 

• 

ISNUL2I 

MOV 

ATRUE#  ( E ) 

RTS 

PC 

EQUAL  1 

CMP 

(E  )♦»  (E) 

IPUSH(POP(E)sPOP(E)»  e) 

BEQ 

EQUAL2 

J2-BYTE  COMPARISON  i SINGLE  byte  items 

CLR 

<E> 

l MUST  BE  EXPANOED  TO  WOROS  CONSISTENTLY 

RTS 

PC 

EQUAL2I 

MOV 

*TRUE» (E ) 

CASEl 

RTS 

PC 

;no-op 

TEST  1 

CMP 

(E  )♦. (E ) 

;IF  POP ( E > NEQ  TOP ( E ) THEN  SKIP  PAST 

BEQ 

TEST2 

jnext  matching  endtst  INSTRUCTION 

MOV 

«TESTX#INSTl 

1 ELSE  POP(E) 

MOV 

fENDTSX, INST2 

JSR 

PC # SK I PTO 

RTS 

PC 

TEST2I 

CMP 

(E  )♦# (E) 

RTS 

PC 

ENOTSTl 

MOV 

fCASEX»!NSTl 

jSKlP  PAST  MACHING  ENOCAS  INSTR 

MOV 

f ENOC  AX  # I N ST2 

JSR 

PC.  SKIPTO 

enDcasi 

RTS 

PC 

) A NO-OP 

procni 

MOV 

#M.  REG2 

IREG2  will  point  to  chars  after  retx 

PROCN1 I 

CMPB  (RFG2)+,  ARETX 

BNE 

PROCNI 

MOV 

epc,  Reg i 

;regi  will  point  to  chars  after  procn 

PR0CN2I 


CMPB  ( RFG 1 } ♦ # 
BNE  PROCNI 
CMPB  IBLAK’K#- 


BNE 

PR0CN2 

MOV 

REG2»-(E ) 

MOV 

REG  t » EPC 

RTS 

PC 

CALLI 

MOV 

EPC.-(P) 

SUR 

• 6 » P 

MOV 

(E  >♦»-(?  ) 

MOV 

(EW»-(P) 

MOV 

(E>*.-(P) 

MOV 

(EW.EPC 

RTS 

PC 

RETuRNi 

Aon 

*12. *P 

CMP 

P.  iPSTAK. 

BNE 

RET2 

MOV 

*true»  stop 

RTS 

PC 

RET2I 

MOV 

<P>*.  EPC 

RTS 

PC 

SAVE! 

MOV 

(E),6(P) 

MOV 

2(E). 8. (P) 

MOV 

*(E)#10»(P) 

RTS 

PC 

(REG2  )♦ 

J F I ND  NEXT  "RETURN"  IF  MISMATCH 
1 (REG2  } 

iCONTlNUE  UNLESS  BOTH  ARE  BLANK 

;push(called  addr,  e) 
jepc  » = LOC  AFTER  name 

)PUSH(FPC»  P>  (RETURN  AqOR> 

J PUSH  3 TEMP  LOCATIONS 
JPUSH(POP(E)»  P)  (OP-PARAMETER) 
) PUSH(PQP(E ) » P)  (R-PARAMETER) 

iPUSH(POP(E)»  P)  ( I-PARAMETER) 

I EPC  ts  POP(E)  (CALLING  ADOR) 

IPOP  6 WORDS  OFF  PSTACK 

jhalt  ie  p-stack  empty 


;EPC  l * RETURN  ADDRESS  (*POP(P)) 


) STORE  T«E  TOP  3 THINGS  ON  E IN  THE 
I TEMP  locations  IN  the  p-stack 
jooes  vQT  affect  e 


-I..-  ...  . fcl— 


Mia  . 


I NST 1 * 
1NST2I 
SKlPTOl 
SKlP?l 


SKIP!! 
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.WORO  I PARAMETERS  FOR  SKlPTQ  SUBROUTINE 


*WORO 
CLR  REGl 

CMPB  (EPC  > . INST2 
BEO  SK1P1 

CMPB  <EPC)*»  INST1 

BNE  SKIP2 

INC  REGl 

BR  SKIP2 

INC  EPC 

DEC  REGl 

BGE  SKIP2 

RTS  PC 

• END 


ITHIS  ROUTINE  USED  BY  "TEST"  ANO  "ENOTST" 

) ADVANCE  EPC.  INCREMENTING  REGl  ON  INSTl. 
I DECREMENT  I NG  REGl  ON  SOUGHT  INSTRUCTION 
KCONTAINEO  IN  INST2 ) « 

JOUIT  WHEN  REGl  IS  LESS  THAN  0 
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